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In higher organisms, all cells share the same genome, but every cell expresses only a 
limited and specific set of genes that defines the cell type. During cell division, not only 
the genome, but also the cell type is inherited by the daughter cells. This intriguing 
phenomenon is achieved by a variety of processes that have been collectively termed 
epigenetics: the stable and inheritable changes in gene expression patterns. This article 
reviews the extremely rich and exquisitely multi-scale physical mechanisms that govern 
the biological processes behind the initiation, spreading and inheritance of epigenetic 
states. These include not only the changes in the molecular properties associated with 
the chemical modifications of DNA and histone proteins, such as methylation and acety¬ 
lation, but also less conventional changes, typically in the the physics that governs the 
three-dimensional organization of the genome in cell nuclei. Strikingly, to achieve stabil¬ 
ity and heritability of epigenetic states, cells take advantage of many different physical 
principles, such as the universal behavior of polymers and copolymers, the general fea¬ 
tures of dynamical systems, and the electrostatic and mechanical properties related to 
chemical modifications of DNA and histones. By putting the complex biological litera¬ 
ture under this new light, the emerging picture is that a limited set of general physical 
rules play a key role in initiating, shaping and transmitting this crucial “epigenetic 
landscape”. This new perspective not only allows to rationalize the normal cellular 
functions, but also helps to understand the emergence of pathological states, in which 
the epigenetic landscape becomes dysfunctional. 
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I. INTRODUCTION 


A. An intricate history 

The word “epigenetic” has been introduced by 
Waddington in 1942 in the context of development, to 
qualify all the processes relating the genotype and the 
phenotype of an organism (Waddington 19421. The as¬ 


sociated investigations belonged to the domain, novel at 
that time, of developmental genetics. The word “epige¬ 
netic” in this original meaning was imprinted by the pre¬ 
existing concept of epigenesis, namely the fact that the 
organism is not fully achieved in the initial cell but expe¬ 


riences complex developmental processes (Gilbert 2012). 


Epigenetics was at that time the mechanistic study of 
how genes guide the epigenesis (development) of an or¬ 
ganism, what is captured in a metaphoric way by the 
famous Waddington’s epigenetic landscape: a landscape, 
shaped by the genes, on which the organism would evolve 
during its development as a rolling stone on the land¬ 
scape, following one of the possible epigenetic pathways 
(see Fig. [^. 

In parallel, the adjective has been used with the mean¬ 
ing of “para-genetic”. Epigenetic systems, as opposed to 
the genetic system, were conceived as “signal interpreting 
devices” (Nanney 19581, i.e. mediators between signals 


- environmental or physiological cues - and the genomic 
response, mainly at the level of transcriptional regula¬ 
tion. 

Due to this dual origin of the word “epigenetic”, the 
associated concepts have developed in several ways (see 


(Haig 20041 for a detailed historical account) and one 


could find in 1994 the two following complementary def¬ 
initions of epigenetics (Holliday 1994|): (1) changes in 


gene expression which occur in organisms with differ¬ 
entiated cells, and the mitotic inheritance of the asso¬ 
ciated patterns of gene expression; (2) transgenerational 
inheritance, that is, transmission through meiosis of non- 
genomic information. 

Due to this intricate history, a consensus defini¬ 


tion of epigenetics is still lacking today (Dawson and 


Kouzarides 20121. Notably the transgenerational inher¬ 


itance, albeit largely documented in plants, remains a 
matter of debate in animals and especially in humans. 

Recently, some authors proposed an operational def¬ 
inition of epigenetics: “An epigenetic trait is a stably 


heritable phenotype resulting from changes in a chromo¬ 
some without alterations in the DNA sequence”. This 
will be the definition we use in this review. To be even 
more specific, we note that: 

Epigenetics is the modification of the function(s) of a 
gene, that is stable and heritable during mitosis, possibly 
during meiosis. 

Epigenetics is not the reversible regulation of transcrip¬ 
tion in response to metabolic cues, because this is not 
stable nor heritable. 


B. Scope of this review 


In this review, we not only intend to analyze the 
physics that drives or accompanies epigenetic marking, 
but we also aim at understanding the rationale behind 
this marking. And physics is a beautiful, yet underrated 
guide to reach this goal. 

Several epigenetic mechanisms will be distinguished: 
those occurring at the level of DNA, those involving hi¬ 
stone post-translational modification, and less conven¬ 
tional ones involving chromatin topology (supercoiling) 
and nuclear architecture. 

We first introduce in section [H] the physical template 
of epigenetic marking, namely chromatin. 

Section m is devoted to the physics behind the family 
of processes at work in the way epigenetic marks control 
gene expression in different cell types. 


Section [TVj addresses the issue of the initiation, spread¬ 
ing, maintenance, and heritability of the epigenetic marks 
in the framework of dynamical systems. 

In Section [V] we review other epigenetic processes 
that have a less clear-cut physical interpretation: DNA 
methylation, imprinting, chromosome X inactivation, su¬ 
percoiling marking. In conclusion we finally propose a list 
of currently significant and challenging issues. 

Due the fundamentally different logic of transcriptional 


regulation in prokaryotes and eukaryotes (Struhl 19991, 


we will let aside the realm of bacteria, although epigenetic 
switches have been observed as well in prokaryotic cells 


and have been modeled successfully ( 

Lim and Van Oude- 

naarden 2007 

Norregaard et al. 

2013 

I. 


We hope this review will be a stimulating introduction 
to epigenetics for physicists as well as an “alternative 
reading frame” of epigenetics for biologists that will help 
tackling cutting-edge advances in current topics ranging 
from nuclear organization and cell differentiation up to 
cancer progression and chronic diseases. 


II. THE PHYSICAL TEMPLATE OF EPIGENETICS: 
CHROMATIN 

In all living organisms, DNA encodes the genetic 
instructions required to synthesize proteins, the basic 
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FIG. 1 (Left) The epigenetic landscape described by Waddington (Waddington 19571 represents the process by which the 


cell (represented by the ball) faces different possible paths during development (i.e. choose one of the permitted trajectories), 
leading to different cell fates. (Right) The landscape is dynamically determined by hidden wires that symbolize genes expression 
and interactions. See Sec. |IV.D|and Sec. |VI|for a physical reanalysis of this metaphorical picture. Adapted from (|Waddington| 


19571. 


bricks ensuring the proper functioning of the cell. The 
main steps of protein synthesis are DNA transcription 
into RNA, then RNA translation into an aminoacid chain 


and chain folding to form a functional protein (Alberts 

eTZlIM^. 


The very same genome is found in each cell. It has 
to be packaged inside its tiny volume, and has to be re¬ 
trieved at will for physiological purposes. DNA is there¬ 
fore embedded in an orderly and dynamically retrievable 
architecture. Two main organizational strategies can be 
identified. In prokaryotes (bacteria), DNA is located in 
the same compartment as all other intracellular com¬ 
ponents. In eukaryotes (from the unicellular yeast up 
to multicellular organisms, including fungi, animals and 
plants), DNA is sequestered in the nucleus, a dedicated 
compartment enclosed within a membrane. 

In the cell nucleus, multiple long linear DNA molecules 
are organized by architectural proteins to form chromo¬ 
somes. From a physicist point of view, chromosomes are 
giant polymers. During mitosis, i.e. cell division, chro¬ 
mosomes duplicate and then condense in the well known 
“X” shape, with each DNA copy forming one of the two 
rods (the sister chromatids, bound together at the cen¬ 
tromere). The rest of the time (i.e. during interphase), 
chromosomes are less condensed and fill the whole nu¬ 
cleus, more or less homogeneously ( Leblond and El-Alfy[ 
19981. To give a quantitative idea of the composition of 


an interphase nucleus, the dry matter of a yeast nucleus 
is about ~70-80% in protein, ~20-30% in RNA, and only 


2% in DNA (Rozijn and Tonino, 1964). 


In this section we introduce the basic concepts that 
come into play in the study of epigenetics. In Sec. |II.A| 
we give an overview of the molecular structure of chro¬ 
matin, and we introduce the concept of epigenetic marks. 


In Sec. |II.B] we will give an overview of the large-scale or¬ 
ganization of chromatin in the cellular nucleus, stressing 
the importance of this organization in gene expression. 
Finally, in Sec. |II.C| we give a synthetic picture of these 
two aspects, in the framework of polymer physics. 


A. Molecular picture of chromatin and its modifications 

In eukaryotic organisms, chromosomal DNA is asso¬ 
ciated with proteins to form chromatin. The principal 
proteins associated with DNA are called histones. Hi¬ 
stones are polypeptidic monomers of five types: His¬ 
tone 1 (HI) class. Histone 2A (H2A) class. Histone 2B 
(H2B) class. Histone 3 (H3) class and Histone 4 (H4) 
class. Each histone family has variants whose presence 
in chromatin depends on the species, the cell type, and 
the development stage. The classical structure of the 
histone-DNA assembly consists of 1.7 left-handed turns 
of double strand DNA (approximately 147 base pairs or 
bp) wrapped around a histone octamer composed of two 
copies of each histone monomer H2A, H2B, H3 and H4 


(Davey et al.\ 2002 huger et al.. 1997). In most species. 


this assembly, referred to as the nucleosome core parti¬ 
cle (NCP) (see Fig. |^,b), may also integrate a copy 
of HI (linker histone) at the DNA entry/exit point, al¬ 
though HI does not share the ubiquity of the other his¬ 
tone classes. 

In addition, consecutive NCPs are separated by linker 
DNA whose length ranges from 20 to 60 bp. Indeed, 
chromosomes are a succession of NCPs and DNA link¬ 
ers. The basic structural unit (monomer) is made of one 
NCP and one DNA linker, and is called the nucleosome. 
The number of DNA base pairs inside one nucleosome 
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FIG. 2 Detailed structure of the Nucleosome Core Particle (NCP) and chromatin fiber, (a) left: internal tertiary structure of 
the NCP147 (PDB: 1KX5) H2A is in green, H2B in blue, H3 in yellow, H4 in red, and DNA in black, right: electrostratic 


potential at the NCP surface (computed using the PDB2PQR/APBS plugin (Baker et al. 2001 Dolinsky et al. 20071 of the 


Visual Molecular Dynamics (www.ks.uiuc.edu/Research/vmd) software package! Humphrey et al. 19961, ionic strength 0.15 
M monovalent salt), (b) Cartoon of the NCP showing histone tails and the globular core. Lysine and arginines residues are 
marked by asterisks (from (Wolffe and Hayes| |1999|)). (c) Electron Micr oscopy images of nucle osome arrays with low (left) or 


high (far right) ionic forces, with details of nucleosome (middle), from (Olins and Olins 
sedimentation assays principles, adapted from (Pepenella et al.\ 20131. 


20031. (d) Reconstituted chromatin 


is the Nucleosome Repeat Length (NRL) (see Fig. [l,d), 
which is not constant and may vary along the genome 
and across various tissues. 

Electrostatic interactions are important because the 
NCP has a charge of -150e, to which DNA contributes 
-294e and histones +144e. The NCP is therefore not 
electrically neutral, so the folding of nucleosome arrays is 
highly dependent on the presence of positive counterions 


(Bertin et al. 2007b Yang and Hayes 20111. Addition¬ 


ally, the charge distribution in the NCP is not spatially 
homogeneous (see Fig. 

Epigenetic marks are chemical covalent modifications 


of either DNA (namely DNA methylation, see Sec. V.Al, 
or histones (so-called post-translational modifications, 
PTMs, see Sec. III.B). The DNA methylation state and 


the histone PTMs are transmitted through cell division 
both because they are covalent and thanks to specific 
mechanisms. DNA methylation is accurately transmit- 
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ted by a specific molecular mechanism (see Sec. V.A). 
Histone PTMs are inherited in a fundamentally different 
way, which will be the principal subject of Sec. |IV| 


B. Large-scale picture of chromatin 


Eukaryotic chromosomes are giant polymers, each 
formed by a huge string of nucleosomes. The conforma¬ 
tion of this string at different length scales is generally 
described using an analogy with proteins: the string of 
nucleosomes itself can be viewed as the primary structure 
of chromatin; the conformation adopted by an array of 
a few dozen successive nucleosomes forms the secondary 
structure of chromatin. The 3D structural arrangement 
of several arrays can finally be viewed as the chromatin 
tertiary structure (|Luger et al. 2012 Pepenella et al. 
see Fig. il- 

When observed by electron microscopy (see 
Fig. ^ ,b,c), interphase chromatin appears to fill 
the entire nucleus volume. As genome length may vary 
considerably from organism to organism, the nucleus 
size varies accordingly: orders of magnitude go from 
^ 10 Mb (Mega base pairs) for a diameter of the order 
of 2 ^m in yeast (Fig. [^), to ^ 100 Mb and 4 /rm in 
drosophila fly (Fig. [^), and up to ~ 1000 Mb and 10 
fim in mammals (Fig. §). These differences in size are 
certainly correlated with the differences in chromatin 
organization that can be directly deduced by simple 
inspection of electron microscopy images. 

Yeast nuclei are the most homogeneously filled ones, 
with a large, denser region called the nucleolus, which 
is known to be the site of very intense ribosomal RNA 
synthesis. A smaller, dark linear body can also be seen 
in the inset, connected with a star-shaped structure, the 
spindle pole body (SPB) from which tubular proteic as¬ 
semblies, microtubules, stem and “hold” chromosomes at 
their centromeres. In contrast with multicellular organ¬ 
isms, in yeast this microtubule bundle is preserved all 
along the cell cycle. It is a crucial organization center 
for the assembly of chromosomes in interphase and for 
chromosome segregation during mitosis (see Fig. [3]i,d,g). 

When the nuclei of multicellular organisms are consid¬ 
ered (Fig. c), their most striking feature is the coexis¬ 
tence of distinct denser and less compact regions. These 
regions are persistent and are not simply the result of 
temporal fluctuations of chromatin density. These fea¬ 
tures have been shown to strongly correlate with the tran¬ 
scription activity of genes. Active genes tend indeed to 
gather at the center of the nucleus, in a region where chro¬ 
matin is less dense and more accessible, which is called 
euchromatin. Inactive genes are found instead in denser 
regions, called heterochromatin, and tend to associate 
with the nuclear periphery. As a stunning example of 
chromatin compaction and localization changes induced 
by transcription, the activation of a genomic locus results 


in a dramatic change of its topology (Fig. |^). 

With the improvement of imaging and labeling tech¬ 
niques, gene transcription by the RNA polymerase PolII 
in multicellular organisms has been shown to occur in 
well-defined loci, called factories (Fig.[^) ([Jackson et al. 


1993). These factories are located within the euchro¬ 


matin domain and each factory has a propensity to gather 
co-regulated genes (Jackson et al. 1998). In this picture, 
it appears that the functional differences between cell 
types are related to the way the genome is folded in the 
nucleus of these cells. 

In the last two decades, impressive advances in ex¬ 
perimental techniques in measurements of 3D chromoso¬ 
mal contacts have been made, starting from the “Chro¬ 


mosome conformation capture” approach (Dekker et al. 


2002). Its genome wide derivative (Hi-C) enables the gen¬ 


eration of contact maps at the genome scale (Lieberman- 


Aiden et al. 2009). From these maps, it is possible to 


reconstruct the underlying 3D structure of the genome 
and such structures are represented on Fig. p,e,f. The 
results confirmed the tethering of centromeres in yeast 
and drosophila, but also of telomeres. In humans the re¬ 
construction of chromosome 1 gives a visual illustration 
of decondensed euchromatin loops emanating from glob¬ 
ular heterochromatin globules. These are decorated with 
two different specific histone marks. We will come back 
on the results of these investigations in Sec. 


C. Chromosomes as polymers 


Most of the modeling efforts addressing the question 
of the nuclear organization have been so far oriented by 
polymer physics. The question then arises as to under¬ 
stand whether polymer physics is the main player that 
drives chromosome organization. 

(i) In the simplest case of yeast, where chromosomes 
are shorter and all anchored at the SPB by their cen¬ 
tromeres, it seems to be the case. Indeed, several poly¬ 
mer simulations have been able to reproduce the struc¬ 


ture of interphase yeast nuclei (Tjong et al. 2012 Wong 
et al.. 2012), see Fig. is El Moreover, fluorescent mi¬ 
croscopy has been used to check the dynamical behavior 


in vivo of given chromosomal loci, (Albert et al. 2013 


Hajjoul et al ., 20131. Single particle tracking has revealed 


a quite uniform response within the genome, character¬ 
istic of polymers in confined spaces. Except for telom¬ 
eres and for the highly transcribed DNA in the nucleolus. 


^ Note that, in yeast, the whole genome is actively transcribing 
most of the time, with the only exception of the regions that 
govern the cell sexual behavior, called “hidden mating type loci”, 
and of the chromosome extremities, called telomeres, which pro¬ 
tect the ends of the chromosome from damaging or from fusion 
with other chromosomes. Therefore heterochromatin is restricted 
to telomeres and mating type loci in this case. 
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FIG. 3 Nuclear organization in yeast {S. Cerevisiae), drosophila and mammals, (a-c) Electron microscopy images of nuclei 
in yeast (a) in drosophila (b) and human (c). Scale bars correspond respectively to 1 fim, 2 fim and 2 /rm. In (a), the 
nucleolus is the darker region in the upper part of the nucleus. The SPB is shown by a circle. In (b), the nucleolus is 
the dark circular region and heterochromatin can be seen as darker spots. In (c), the nucleolus is marked by a dashed 
circle. (Figure (a) from http://scienceblogs . com/transcript/2006/08/16/the-centrosome-and-the-spindle/, (b) from 
http://pixgood.com/nuclear-pore-em.html, (c) from http://tinyurl. com/m6phpf 8). (d-f) 3D models reconstruction (|Lesne| 


et al.\ 20141 from contact maps obtained using the Hi-C protocol in these three organisms (Dixon et al. 2012 Duan et al. 
2010 Sexton et al. 20121. In (d) and (e) all the chromosomes are represented with different colors. In (f), only chromosome 


1 is shown and the colors correspond to regions harboring different epigenetic marks: H3K9Ac and H3K9me3 (see Sec. III.Bl. 
On each reconstruction, centromeres are shown as black beads and telomeres (chromosome ends) as purple beads. Each bead 
represents respectively 12 kb, 40 kb and 40 kb. (g-h) Polymer models of the genome in yeast and drosophila: In (g) each 
chromosome is labeled with a different color (from (Wong et al. 20121). In (h) colors correspond to the colors of chromatin 
(see Sec. |III.C| courtesy of Giacomo Cavalli and Pascal Carrivain (IGH, Montpellier)), (i) To our knowledge, physical models 
of the human genome have not been developed so far. 
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(Hansen 2002 Pepenella et al. 20131 and binding of ar¬ 


chitectural proteins. This conformation is essential for 
gene regulation. The epigenetic marks present on DNA 
and histones, by mediating specific interactions between 
portions of chromatin, alter its conformation and hence 
its function. The next section will be devoted to un¬ 
derstanding the complex relationship between epigenetic 
marking and genome structure and function. 


FIG. 4 Fluorescence microscopy of nuclei, (a) a specfic region 
of the genome, labeled in red, is decompacted after the induc¬ 
tion of transcription, (adapted from ( [Tumbar et al. 19991) 
(b) A human cell nucleus. DNA is labeled in blue, PolII in 
red (from (Crepaldi et al.\ 20131) 


yeast chromosomes behave as a polymer brush, and are 


essentially organized by simple physical principles (Huet 
et al. ] |20T4l ) (see Fig. 

(ii) In the well-studied, intermediate-size case of the 
drosophila, recent investigations tend to indicate that 
this polymer behavior is partially conserved, but with 
some significant changes that complexify the picture (see 
Fig. ,h). Roughly speaking, it has been proposed 
that euchromatin and heterochromatin have intrinsically 
different biochemical and physical properties, due to a 
deeply different protein “dressing” of the DNA molecules. 
More precisely, Filion and co-workers have identified 
five principal chromatin states, called chromatin “col¬ 
ors” from the analysis of 53 chromatin protein genome¬ 


binding profiles in drosophila cells (Filion et al.. 2010). 


Among these states, some essentially correspond to ac¬ 
tive, transcribing euchromatin, other to dense, repressed 
heterochromatin. These chromatin states result from the 
recruitment of DNA-binding proteins that are specific of 


the underlying epigenetic marks (see Sec. III.B). Note, 


however, that the colors are only an arbitrary choice of 
the authors and do not correspond to actual coloring of 
the chromatin, due, e. g. , to staining. 

As a consequence, drosophila chromosomes are more 
properly described as co-polymers, i.e. polymers con¬ 
taining more than one type of monomer. A model of the 
resulting copolymer brush is depicted on Fig. ii. 

(iii) In mammals heterochromatin is mainly located at 
the nuclear membrane and euchromatin at the center of 
the nucleus (see Fig. §). The reconstituted 3D struc¬ 
ture of chromosome 1 (the longest human chromosome) 
shows an alternance of long loops of euchromatin and 
dense parts of heterochromatin tethered to the nuclear 
membrane (see Fig. §)■ 

In summary, the conformation adopted by chromatin is 
affected by its intrinsic structural parameters such as the 


NRL (the reader may find an extensive review in (Boule 
eTZIIM^ ), on top of which lies an additional layer of 
modulation by internucleosomal electrostatic interactions 


III. FROM EPIGENETIC MARKS TO REGULATION OF 
GENE EXPRESSION THROUGH THE 3D 
ORGANIZATION OF THE GENOME 

A. General principles of gene silencing. The paradigm of 
DNA accessibility 

During development, the determination of the cell type 
(cell fate) involves progressive restrictions in its develop¬ 
mental potency and results from differential gene expres¬ 
sion. DNA methylation is a key control parameter of 
this process: genes that are specific for the desired tissue 
are kept unmethylated, whereas the others are methy¬ 
lated. Moreover, patterns of DNA methylation are faith¬ 
fully propagated throughout successive cell divisions (see 


Sec. V.A). However the physics of DNA methylation is 


still elusive and we therefore postpone further develop¬ 
ments on DNA methylation to the last part of this review 


(see Sec. V.A). 

Epigenetic regulation of gene expression involves si¬ 
lencing, i.e. a permanent and heritable inhibition of gene 
transcription (transciptional gene silencing) or transla¬ 
tion (post-transcriptional gene silencing). The current 
paradigm is that gene silencing is achieved through chro¬ 
matin condensation, in a so-called heterochromatiniza- 
tion process (Grewal and Moazed 2003). Can we char¬ 


acterize the physical properties of heterochromatin and 
euchromatin? What are the physical consequences of het- 
erochromatinization in terms of structure, dynamics and 
how do these physical consequences turn out into func¬ 
tional consequences? 

Histones simultaneously play a crucial role in deter¬ 
mining the structure of chromatin; they are the substrate 


of a vast catalog of epigenetic markings (Cantone and 


Fisher 2013 Kouzarides 2007), which is not a coinci¬ 


dence. This supports the hypothesis that epigenetic his¬ 
tone marks modulate gene expression through chromatin 
structural rearrangements at each level of the nuclear 
organization: nucleosome, chromatin fiber, chromatin 
loops, chromosome territories, whole nucleus (Poirier 


et al.. 2009 Zhou et al. 2007). 


The potentially huge number of combinations of epi¬ 
genetic marks has led to the hypothesis that transcrip¬ 
tion factors (TFs) might be targeted to nucleosomes en¬ 
dowed with specific combinations of histone tail post- 
translational modifications. This was coined as the “his¬ 
tone code” hypothesis at the beginning of the millenium 
















































and remained popular in biology for the better part of a 
decade.While the genetic code encodes the sequence of a 
protein using the four bases of the DNA, this histone code 
would encode the regulatory events involved in triggering 
its expression, and in particular the differential expres¬ 


sion observed in cells of different types (Jenuwein and 


Allis, 2001). However, it was then realized that only a 


few combinations of histone marks are actually observed 
(Lennartsson and Ekwall 2009 Rando 2012). These rel¬ 
evant combinations can even be reduced to 5 classes in 
the case of drosophila, the so called “five colors of chro¬ 
matin” as identified by Filion et al. (Filion et al. 2010). 
Moreover, it was also realized that the histone code is 
actually not a genuine code insofar as the correspon¬ 
dence between the codewords (combinations of epigenetic 
marks) and their meaning (the TFs they code for) is not 
gratuitous ( Kiihn and Hofmeyr[ 2014[ Lesne[ 2006). In¬ 
stead, the correspondence is a biochemical recognition of 
the pattern of histone marks by the TF. This is why it 
was recently proposed that chromatin may have evolved 
as an allosteric enzyme able to mediate a gratuitous cor¬ 
respondence between epigenetic marks and TF binding 
to the underlying DNA sequence (Lesne et al.. 2015). 


B. Histone modifications as chromatin structural 
modulators 

Most epigenetic marking occurs on the histones that 
coat DNA. What are the physical consequences of this 
marking and what is its effect on chromatin organization? 


1. Histone tails and their role in internucleosomal interactions 

As already mentioned, nucleosomes are formed by 
wrapping DNA around an octameric protein assembly 
formed by histone proteins. The N-terminal sequences of 
H2A, H3 and H4 extend from the globular histone core 
to form the so-called histone tails (see Fig. [^). The H3 
and H4 tails consist respectively of 35 and 20 residues, 
of which respectively 13 and 9 are positively charged 
(lysines, K and arginines, R). These tails are intrinsi¬ 
cally disordered protein domains, hence adopt a random 
coil configuration, as suggested by crystallographic stud- 


ies (Davey et al. 

2002 huger et al. 

1997 

and proteolytic 

cleavage assays. Tails contribute differently to intranu- 
cleosomal stability and internucleosomal interactions (Al- 

Ian et a/. |1982 |Arya and Schlick 

2006 

2009| ISinha and 

Shogren-Knaak 

2010 Zhou et al. 

2007 

'). The two H3 


tails exit from the histone core close to the DNA entry- 
exit site of the nucleosome, and associate preferentially 
with DNA to “lock” its wrapping around the histone core. 
The H4 tails are known to associate with a set of seven 
residues referred to as the H2A/H2B acidic patch, lo¬ 
cated on the H2A-H2B interface (see Fig. A H4 tail 


on one nucleosome may interact with an H2A/H2B acidic 
patch on a adjacent nucleosome, acting as a tether con¬ 


necting the two nucleosomes (Kalashnikova et al. 2013 


Kan et al. 2009). The H2A and H2B tails, much shorter 


than their H3 and H4 counterparts and the subject of 
a much smaller literature, do not seem to significantly 
contribute to internucleosome interactions, although they 


are required for proper nucleosome reconstitution (Bertin 


et al.. 2007aI. 


2. Histone tail post-translational modifications (PTMs) 


Histone tails, besides their role in the structuration 
of nucleosome arrays, are also the support of virtually 
all PTMs targetting histones, which consist in replac¬ 
ing groups of atoms on one residue by another, chemi¬ 
cally different one (see Fig. [^. Fo r an historical account 
of their discovery, see (Morange 2013). The globular 


histone core and the lateral surface of the nucleosome 
may also undergo post-translational modifications, which 


modulate the nucleosome stability, DNA wrapping (Tes- 


sarz and Kouzarides[ |2014[ [Tropberger and SchneidCT 


2013), hence chromatin architecture. The repertory of 


histone-tail PTMs is vast both in terms of types of modi¬ 
fications and in terms of where the modification can take 


place ( 

Fierz 

2014 fPepenella et al. 

Henikotf 

201 

3). 


In order to reach a comprehensive physical picture we 
oversimplify the daunting complexity of epigenetic his¬ 


tone PTMs (Kouzarides 2007) to focus here on: 


(i) lysine methylation and notably the two main hi¬ 
stone PTMs that are involved in gene silencing: tri- 
methylation of the lysine 9 of H3, noted H3K9me3, which 
recruits HPl and tri-methylation of the lysine 27 of H3, 
noted H3K27me3, which recruits the Polycomb architec¬ 
tural complex; 

(ii) lysine acetylation and specifically the acetylation of 
lysine 16 of H4, H4K16ac, which is a hallmark of active 
chromatin (actively expressed genes). 

Epigenetic marks are deposited on or removed from 
histone tails by dedicated enzymes, so-called “writers” 


and “erasers” (Fierz 20141). Writers devoted to acety¬ 


lation are histone acetyltransferases (HAT), notably ly¬ 
sine acetyltransferases (KAT), and writers devoted to 
methylation are histone methyltransferases (HMT), no¬ 
tably lysine methyltransferases (KMT). Erasers are his¬ 
tone deacetyltransferases (HDAC) and histone demethyl- 
transferases (HDM), notably lysine demethyltransferases 
(KDM), see Fig. A comprehensive list of the known 
writers and erasers is given in table 

A wealth of data exists regarding the presence of 
histone tail modifications in different species, develop¬ 
ment stages and cell types - the so-called epigenome - 
but efforts for characterizing the effect of histone PTMs 
are currently limited by the difficulty of examining in 
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H,C 



Histone methyltransferase 
(HMT) 


SAH SAM 

V J 

Histone demethylase 
(KDM) 



O 


Histone Acetyltransferase 
Lysine Acetyltransferase 
(HAT, KAT) 

ac-coA coA 

^ J 

Histone deacetylase 
(HDAC) 



Trimethyl-lysine 


Dimethyl-lysine Monomethyl-lysine 


Lysine 


Acetyl-lysine 


FIG. 5 Chemical formulas of unmodified, acetylated, and mono, di, tri-methylated lysine residues. Methylation (from lysine 
to the left) is achieved by successive additions of single methyl groups, using the SAM metabolite as a source of methyl groups. 
Acetylation (from lysine to the right) is achieved using the acetyl-coA cofactor as a source of acetyl groups. Acetylation reduces 
the charge at biological pH, whereas methylation preserves the charge. 


vivo chromatin structure. Interestingly, the two main 
modifications discussed here - lysine acetylation and ly¬ 
sine methylation - seem to act on the chromatin ar¬ 
chitecture and state of activity through rather differ¬ 
ent mechanisms. In the case of acetylation, a direct 
effect on nucleosome-nucleosome interactions is at play, 
with a certain but subtle relationship with the associ¬ 


Knaak et al. , 2006 Wang and Hayes 20081. In addi¬ 


ated loss of a positive charge (see Sec. III.B.3). In con¬ 


trast, methylation preserves electric charges, while intro¬ 
ducing significant steric hindrance and potentially hy¬ 
drophobic interactions, and mainly act on chromatin in¬ 
directly by recruiting additional architectural proteins 


(see Sec. III.B.5). For this reasons, acetylation mecha¬ 
nisms are more easily studied by in vitro experiments, 
while methylation effects are more generally studied in 
the in vivo context in presence of their multiple part¬ 
ners. We will now sum up some of the main experi¬ 
mental results and theoretical interpretations concerning 
both these PTMs. 


3. Histone tail acetylation: direct effects on chromatin 
accessibility 

a. Experiments Experimental studies of the role of hi¬ 
stone tail acetylation in the architecture of nucleoso- 
mal arrays are conducted using reconstituted, in vitro 
chromatin. In this approach, nucleosomes are reconsti¬ 
tuted by incorporating recombinant histones with tai¬ 
lored aminoacid sequences on tandem repeats of a DNA 
sequence with very high histone affinity (the so-called 
“601 sequence”). The sedimentation coefficient of such 
arrays is then measured as a proxy for their folding 
propensity, comparing the sedimentation coefficient of ar¬ 
rays with of without combinations of histone tail acety¬ 


tion, small-angle X-ray scattering assays on folded nucle- 
osome arrays give estimations of internucleosome inter¬ 


action energies (Bertin et al. 2007c Howell et al. 20131. 


Taken together, these studies show that H4 tail acetyla¬ 
tion decreases internucleosomal intra-array associations 


(Hizume et al. 2010). 


Acetylation of lysine 16 of histone H4 (H4KI6ac) has 
the strongest effect in this regard, and may lead to 
massive disruption of dense chromatin fibers in vitro 
(Shogren-Knaak et al. 2006). Structural effects of 


H4KI6 acetylation on chromatin compaction are also 
confirmed by the observation of a weakening of chromatin 
packing in vivo (Shahbazian and Grunstein 2007), and 


are in general associated with actively transcribed genes 


(e. g. , (Taylor et al. 2013)). 

Surprisingly enough, histone H3 acetylation, which 
also reduces the charge of the tails, does not seem to mod¬ 


ify the folding propensity of nucleosome arrays (Wang 


and Hayes 2008) pointing to a specific mechanism of 


H4KI6 acetylation. 


b. Models Experimental studies are often combined 
with computational models to provide deeper insights on 
how the electrostatic nature of histone tail PTMs influ¬ 
ence chromatin folding. 


Potoyan & Papoian (Potoyan and Papoian, 2012) ad- 


lation (Allahverdi et at. 2011 Liu et al. 2011 Shogren- 


dressed the question of the decompaction induced by 
H4K16 acetylation, and carried out all-atom simulations 
in explicit solvent to compare the conformation of H4 
tail with and without this modification. Eor the iso¬ 
lated histone tails, H4K16ac leads to slightly more com¬ 
pact and significantly more structured globular H4 tails. 
At this level, compaction is not surprising since the net 
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TABLE I Principal histone lysine acetylations and methylations of the human epigenome and their writers/erasers (adapted 
from (Fierz 2014| ifouzarides} 2007 Pepenella ef 2013[ Sapountzi and Ct 2011 Yang and Seto 2007 ^Zentner and Henikoff| 


20131). Other PTM not shown include arginine methylation and acetylation, serine/threonine phosphorylation, ubiquitination, 
and crotonylation (see (|Sadakierska-Chudy and Filip||2014|)). 

Type 

Family/Class 

Proteins /Complexes 

Targets 


GNAT 

HATl 

H4(K5, K12) 


GNAT 

PCAF 

H3(K9, K14, K18) 

Acetyltransferases 

GNAT 

SAGA 

H3 

(HAT/KAT) 

GBP/p300 

CBP/P300 

H3(K14, K18), H4(K5, K8), H2AK5, H2B(K12, K15) 


MYST 

TIP60 

H4(K5,K8, K12, K16), H3K14 


MYST 

HBOl 

H4(K5, K8, K12) 


MYST 

MOZ,MORF,MOF 

H4 


Glass I 

HDACl-3,8 

H3, H4 


Glass IIA 

HDAC4,5,7,9 

H3, H4 

Deacetylases 

Glass IIB 

HDAC6,10 

H3, H4 

(HDAC) 

Glass III (sirtuins) SIRTl-7 

H3, H4, H4K16 


Class IV 

HDAC 11 

H3, H4 


none 

SIN3-HDAC1,2 

H3, H4 


none 

N-COR/SMRT-HDAC3 H4 


SET 

MLLl-5 

H3K4 


SET 

SETIA, SETIB 

H3K4 


SET 

G9A/GLP 

H3K9 


SET 

SETDBl 

H3K9 

Methyltransferases SET 

SUV39H1, SUV39H2 

H3K9 

(HMT/KMT) 

SET 

EZH2 (PRC2) 

H3K27 


SET 

NSDl 

H3K36 


SET 

SET2 

H3K46 


SET 

SUV420H1, SUV420H2 

H4K20 


none 

DOTIL 

H3K79 


ESDI 

BHCllO 

H3K4 


ESDI 

COREST-LSDl 

H3K4 

Demethylases 

JmjC 

JHDMIA, JHDMIB 

H3K36 

(HDM/KDM) 

JmjC 

JHDM2A, JHDM2B 

H3K9 


JmjC 

JMJD2A, JHDM3A 

H3(K9, K36) 


JmjC 

JMJD2B 

H3K9 


charge reduction weakens self repulsion between the tail 
residues. When DNA is present, i.e. when the entire 
nucleosome is considered, tails have a similar behav¬ 
ior: acetylated tails are more compact, less fluctuating, 
and are more frequently bound to their own nucleosomal 
DNA. However the less charged acetylated tail interacts 
much more strongly (~ 5 — QksT) with DNA than the 
unmodified one (~ 2kBT), in contrast to what is ex¬ 
pected from electrostatic reasons. This counterintuitive 
effect is achieved thanks to an important tail reorgani¬ 
zation that brings other lysines closer to DNA. While 
the overall electrostatic attraction is basically unchanged, 
the collapse of the tail is favored by hydrophobic inter¬ 
action and entropic gain. In contrast, unmodihed H4 
tails are more extended and flexible. They showed a 


preferential interaction with linker DNA (Angelov et al. 


2001) and with an acidic patch exposed on the surface 


of next H2A/H2B dimers of neighboring nucleosomes 
(Zhou et al, 2007) (see Figs. and[^). Hence, while 
modified H4 tails may contribute to the nucleosome- 
nucleosome attractive interaction by the so-called “tail 
bridging” effect (Miihlbacher et ai, 2006), the acetyla¬ 
tion of lysine 16 might oppose this effect, leading to weak¬ 


ened nucleosome-nucleosome interactions (Potoyan and 


Papoian, 2012) (see Fig. ^,c,d). Of note this is qual¬ 


itatively consistent with experiments on the disordered 
C-terminal tail of the p53 protein where a significant in¬ 
crease of its site-specific DNA binding is observed both 
in vitro and in vivo when acetylated (Luo et al.. 2004). 

Other computational models generally rely on coarse- 
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H2AyH2B acidic patch! 


H3 tails V 

\ H4 Tails 


acetylation 

(tri)methylation 


H4K16ac 

Global acetylation level 


H3 acetylation 

'' 


H3K9me3 and HPl 
H3K27me3 and Polycomb 


FIG. 6 Nudeosome arrays and histone PTMs. (a) Cartoon of a nucleosome core particle (histone core in yellow, DNA in blue) 
(b-d) Examples of chromatin structural modulation by histone PTMs. Right: H4K16 acetylation induces the collapse of the 
N-terminal H4 tails on their own nucleosomal DNA, preventing them from binding the H2A-H2B acidic patches of adjacent 
nucleosomes. Middle: H3 methylation recruits chromatin associated proteins to form heterochromatin (e.g. H3K9me3 and 
HPl, H3K27me3 and the Polycomb family complexes). Left: acetylation of H3 tails decreases their affinity for nucleosomal or 
linker DNA and reduces the electrostatic screening of DNA negative charges, leading to changes in the mechanical properties 
of the linker and accessibility of nucleosomal DNA for other proteins. 


grained approximations of the nucleosome core particle 
and linker DNA which integrate the mechanical dynamics 
of nucleosome as well as its distribution of charges. Arya 
& Schlick used their Discrete Surface Charge Optimiza¬ 
tion framework to provide estimations of the contribution 
of tails to electrostatic interaction energies, showing that 
H3 tails principally screen the negative charge of linker 
DNA, while H4 tails mediate internucleosomal interac¬ 


tions (Arya and Schlick 2006 20091, in agreement with 


previous experimental findings. However, these studies 
do not compare interaction energies with or without hi¬ 
stone PTMs. Several other coarse-grained models have 
been used so far to specifically investigate histone tail 


acetylation (Allahverdi et aZ. 2011 Liu el aZ., 2011 Yang 


et al. 2009), showing that the effect of PTMs also largely 


depend on the valency and the concentration of bulk 
counterions, consistent with sedimentation assays. 


4. H4K16 deacetylation is a silencing mark in budding yeast 

In budding yeast, and this is specific to budding yeast, 
silencing is not achieved by histone methylation. Instead 
heterochromatin is induced by silent information regu¬ 
latory (SIR) complexes which are recruited by deacety- 
lated nucleosomes. This mechanism is crucially relying 
on H4K16 (Dayarian and Sengupta, 2013| ) (see below 
Sec. IV.C.l for the detailed mechanism). 


5. Histone tail methylation: indirect effects on chromatin 
condensation 

In animals, notably in drosophila and mammals, si¬ 
lencing is mainly achieved through histone tail methy¬ 
lation which, as mentioned above, does not directly in¬ 
duce chromatin fiber compaction (a notable exception 
was reported in (North et al. 2014| )) but leads to the re¬ 
cruitment of additional architectural proteins, typically 
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heterochromatin proteins. 

Importantly, such architectural proteins are included 
in the set of proteins that have been used to define the 
chromatin colors in drosophila ( Filion et al.\ 20101. Pre¬ 
cisely, chromatin colors are specific combinations of epi¬ 
genetic marks and associated proteins belonging to the 
following set: histone-modifying enzymes, proteins that 
bind specific histone modifications, general transcription 
machinery components, nucleosome remodelers, insula¬ 
tor proteins, heterochromatin proteins, structural com¬ 
ponents of chromatin, and a selection of DNA binding 
factors (Filion et al., 2010). Histone tail methylation 


They display high binding affinity for the H3K9me3 
mark and are therefore specifically targeted to nucleo- 
somes harboring this mark. However the spreading of 
HPl along an H3K9me3 epigenetic domain is still a mat¬ 
ter of debate. Thus in the latest special issue of JPCM 


(Everaers and Schiessel, 2015), devoted to the physics of 


chromatin, two contrasted models have been proposed: 


the group of Andrew Spakowitz (Mulligan et at. 2015) 


claims that bridging interaction between HPl dimers is 
critical for HPl spreading, at odds with the group of 


Karsten Rippe (Teif et al.. 2015) who claims that the 


seems therefore to act as a (region specific) substrate 
to recruit (non specific) proteins. In turn, these proteins 
induce different chromatin-chromatin interactions in dif¬ 
ferent regions, and eventually different chromatin folding 
leading in particular to different compaction degrees (see 
Sec.lIII.C.l). 


There are various kinds of heterochromatin in animals 
(e.g. ‘black’, ‘blue’ and ‘green’ chromatin in drosophila; 
even more “colors” in mammals). We focus here on the 
physical mechanisms that drive the two main silencing 
processes in animals, namely the recruitment and spread¬ 
ing of HPl (Heterochromatin Protein 1) by the H3K9me3 
mark (Azzaz et al.. 2014[ Hathaway et al. 2012) and 
the recruitment of the Polycomb architectural complex 
(PcG) by the H3K27me3 mark (Tie et al. 2009). We 


binding of one HPl dimer can stabilize a stacked nu¬ 
cleosome conformation and facilitate the binding of a 
second dimer via an allosteric change of the nucleosome 
substrate, with no need for a direct interaction between 
neighboring HPl dimers. It is to be noted that both 
groups could reproduce the in vitro binding curves of 
the yeast analog of HPl (Swi6) on mono- and dinucle- 
osomes as well as on arrays of nucleosomes. Moreover 
Spakowitz’s group claims that HPl bridging interaction 
between different chromatin fibers explains the phase sep¬ 


aration of heterochromatin and euchromatin (Mulligan 


et al.. 2015), whereas Rippe’s group evidenced a depen¬ 


moreover discuss the role of these architectural proteins 
in the physical process of heterochromatinization. 

Unlike acetylation, results obtained in vitro using 
reconstituted chromatin arrays are not directly trans- 
ferrable to in vivo contexts for at least two reasons: (i) 
lysine methylation has no direct physical effect (recall 
that, unlike lysine acetylation, lysine methylation does 
not change electric charges), instead, lysine methylation 
is recognized as a biochemical tag by dedicated chromatin 


dence of the binding stoechiometry on the NRL (nucle¬ 
osome repeat length) due to allosteric cooperativity of 
binding for nucleosome arrays with long but not with 
short DNA linkers, pointing to a facilitated spreading of 
HPl on long NRL substrates. 


b. Polycomb-mediated heterochromatin Polycomb are a 
family proteins that mediate transcriptional silencing 
( Di Croce and Helin[ 2013[ Simon and Kingston 2013). 
In drosophila, it was found that two distinct regulatory 
complexes (PRCl and PRC2) are able to silence the Hox 


proteins, either architectural (Cosalia et al. |2014 Mul- 


2001 

Paro et al.\ 

ligan et al. 

2015| |Ong and Corces| 2014| Zentner and 

“cellular memory” 


Henikoff 2013) or remodeling proteins ((Becker and Hrz 


genes in a stable and inheritable way (Beuchle et al. 

1998|). It provides a mechanism for 


(Ringrose and Paro 2004), that has 


2002); (ii) there is considerable cross-talk among histone 
tail PTMs (Bannister and Kouzaride's| 2011[ Kouzarides 


been speculated to be alternative to DNA methylation 

(pTdllM^. 


2007[ Li and Shogren-Knaak 2008)) which can then form 
networks comparable to signaling pathways, eventually 
resulting in a structural effect. An example of such a 


The precise mechanism underlying the heritability of 
the repressed state of genes silenced by the Polycomb 
complexes is still debated. It is known that the repressive 


pathway is given by (Wilkins et al. 2014) in the context 


of budding yeast cell division where phosphorylation of 
Serine 10 of the H3 tail induces H4K16 deacetylation, 
which eventually leads to chromatin compaction. 


a. HPl-mediated heterochromatin The family of Hete¬ 
rochromatin Protein 1 (HPl) are fundamental compo¬ 
nents of heterochromatin. They are abundant at the cen¬ 
tromeres and telomeres (which correspond roughly, as we 
have seen, to central and ending regions of the chromo¬ 
somes, respectively) in nearly all eukaryotes. 


histone mark H3K27me3 (see Sec. HLB.5) is recruited 
by the PRC2 complex. In turn, H3K27me3 recruits 
PRCl, which then induces histone H2AK119 ubiquiti- 
nation. However, recent studies showed that this rela¬ 


tionship may also work in the opposite sense(Blackledge 


et al. 2014 Cooper et al.. 2014). 


It has also been 
inactivation (see 
Sec. V.C), histone ubiquitination, and Polycomb pro¬ 


suggested that in X chromosome 


teins are mechanistically related to propagate the si¬ 
lenced state (|de Napoles et al. 2004). 


A physical modelling of the cross-talk between histone 
marks and the Polycomb complexes would be useful and 
is, to the best of our knowledge, still missing. 
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C. How epigenetic marks organize the chromosomes in the 
cell nucleus. General rules. Physical modeling of 
epigenome wide studies. 

1. Epigenome wide studies 


One of the current paradigms in the field is that the 
epigenetic landscape is driving the 3D genome folding 
and by extension the functional state of the cell. In order 
to tackle this issue at the genome scale level, epigenomic 
techniques based on Next Generation Sequencing (NGS) 
are increasingly used (Rivera and Ren, 20131. These 
techniques are commonly used to map accessibility, pro¬ 
tein binding sites, and biochemical modification of his¬ 
tones or DNA along the linear genome (e.g. in drosophila 
Fig. [^r). A new technique, genome-wide Ghromosomal 
Conformation Capture (Hi-C) has been developed in or¬ 
der to address the issue of genome 3D folding using NGS. 
This technique allows to the generation of a list of pair¬ 
wise contacts between distal parts of the genome in vari¬ 
ous organisms and cell types (e. g. in drosophila, Fig.[^, 
(Sexton et al. , 2012)). A high contact probability is char¬ 
acterized by a bright pixel in the Hi-C contact map (see 
Fig.[^, around the diagonal). The first results have con¬ 
firmed the physical segregation of the genome into hete¬ 


rochromatin and euchromatin regions (Lieberman-Aiden 


et al. 20091. An attractive model of chromosome folding 


has been proposed in this seminal paper, named “frac¬ 
tal globule”, which explains at the same time the ex¬ 
istence of chromosome territories, the absence of knots 
in chromosomes, and the power-law decay (with expo¬ 
nent close to —1) of the contact probability as a func¬ 
tion of genomic distance (Mirny, 2011[ ). This long-lived 
metastable state was introduced 20 years before on theo¬ 
retical grounds (Grosberg et al. , 19881 for explaining the 
kinetics of collapse of homopolymers. At hner scales, Hi- 
C further led to the identihcation of domains along the 
genome in which contacts are numerous whereas very few 
contacts are established in between different domains. 
These regions are termed topologically associating do¬ 
mains, TADs (Dixon et al. 2012 Nora et al.\ |2012 1. 
TADs can be seen as high intensity blocks along the di¬ 
agonal of the chromosomal contact maps (Fig. [^, cyan 
squares). It was then realized that chromosomes are ac¬ 
tually block copolymers, each block corresponding to an 
epigenetic domain (compare Figj^ and[^). Gombining 
Hi-G results with the linear epigenomic annotations of 
the genome (i.e. the biological information of the un¬ 
derlying sequences) is in principle a powerful method to 
comprehend the functional architecture of the genome. 


Several physical models have been developed so far in 
order to understand the 3D folding properties of block 
copolymers. The main goal of these studies is to recover 
the chromosomal contact maps observed from the Hi-C 
data. Two main classes can be distinguished: simula¬ 
tions that explicitly compute the 3D chromosome con¬ 


formations (Barbieri et al. 2012 Benedetti et al. 2014) 
and implicit models in which average contact maps are 
directly computed in a self-consistent Gaussian approxi¬ 
mation (Jost et al.. 2014). The different explicit mod¬ 
els can account for the formation of TADs, either by 
preferential binding of co-factors along specific regions 
of the genome (Barbieri et al. 2012) or by topological 
constraints (Benedetti et al. 2014) but so far, the direct 
comparison with experimental results has only been done 
using the implicit approach (Jost et al. 2014). In this 
study, the authors use the previously described colors of 
chromatin drosophila (Filion et al. 2010), which, as dis¬ 
cussed previously, assign to each subregion of the genome 
an epigenetic “color”, based on the specific protein bind¬ 
ing and histone marks found in this region (Fig. 


They then assign specific pair-potentials (see also (Saberi 


et al. 



) between beads of the same or different 


colors (Fig. [^) and compute the corresponding contact 
maps using a statistical approach previously described 
(Timoshenko et al. , 1998). With well-chosen parameters, 
they were able to retrieve the contacts found experimen¬ 
tally (Fig.[7[i). An important outcome of their study is 
that a hxed epigenetic landscape is compatible with sev¬ 
eral 3D conformations of the chain, a phenomenon which 
they call multistable folding (see below “the physics of 
TADs”). Note that these copolymer models, all at ther¬ 
mal equilibrium, deviate drastically from the fractal glob¬ 
ule hypothesis. 


2. The physics of TADs: finite-size effects in the coil-globule 
transition of copolymers 


Jost et al. (Jost et al.\ 2014) show a phase diagram of a 


toy model copolymer as a function of the intensity of (i) 
block-specific and (ii) non-specific interactions, that we 
show in Fig. On top of the coil-globule transition of 
the whole copolymer, there is also coil-globule transition 
restricted to each separate block. Importantly, both coil 
and globule phases coexist in a region of the phase dia¬ 
gram, the size of which depends on the (average) size of 
the blocks. This is consistent with the finite-size scaling 
analysis of the coil-globule transition which has been pro¬ 
posed in dCare et al.\ |20I4|) (see also arXiv: arXiv:cond- 
mat/0004273). 

Let us show that both transitions, namely the coil- 
globule transition inside a given block and the segrega¬ 
tion of different blocks of the same color into separated 
microphases, overlap in the phase diagram because of 
finite-size effects. 

We first remember that a polymer of N monomers, 
with monomer-monomer attractive interactions, under¬ 
goes a coil-globule transition around the critical temper¬ 
ature Q{N) = 0(1 — b^\n{N)/N) where b is a dimen¬ 
sionless prefactor of order unity. More precisely, there 
is an equilibrium between coil and globule conformations 
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FIG. 7 Modeling of chromosomal contact maps from the epigenomic landscape, (a) Profiles of HI occupancy, DNA accessibility, 
H3K27me3, H3K4me3, HPl and a histone modifier, Su(Hw) along a region of the Dorsophila chromosome 3R. At the bottom, 
the colors corresponding to these profiles are shown. Yellow and red correspond to active chromatin, blue to polycomb bound 
regions. ( [Filion et al. 20101 (b) The corresponding contact map (c) Schematics of the co-polymer model used in ( lost et al.\ 
|2014[) (d) Two predicted contact maps corresponding to the region indicated by the pink dashed square in (b). ((a), (b) adapted 


from ([Sexton et al. 20121; (c), (d) adapted from (lost et al. 20141) 
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FIG. 8 Phase diagram of a toy model copolymer, as a function 


of specific and non-specific interactions. From (Jost et al. 


20141. 


over a temperature range between Q{N) — a/^/hi{N) and 
Q{N)+a/^y\n{N) where a is a dimensionless prefactor of 
order unity. At T = Q{N) both coil and globule confor¬ 
mations are in equal proportions. Therefore, at a given 
temperature T, longer polymers are more globular than 
small polymers of the same kind. 

We then consider a copolymer ABAB... made of small 
blocks A and long blocks B. For example, the A blocks 
could be HP 1-like heterochromatin, and the B blocks 
could be Polycomb-like heterochromatin. We consider 
monomer-monomer attractive interactions represented 
by an energy of interaction Eij between monomers with 
epigenetic states i and j the following kind: Eij = 
Uns + ^ijUg, where Uns is a non-specific term (does not 
depend on i and j), 5ij is the Kronecker delta, and Us is 
a specific interaction term. According to the preceding 
results on the coil-globule transition of finite-size poly¬ 
mers, long blocks B go into globules when small blocks A 
are still coils. When lowering the temperature (or equiva¬ 
lently increasing the interactions), blocks B start to tran¬ 
siently bind together into a macroglobule: this is now the 
coil-globule transition of the whole copolymer which is 
equivalent to a chain of B globules separated by A link¬ 
ers; and while this chain collapses (folds) the A linkers 
start to go into globules, so that both transitions overlap. 

Importantly the macroglobule fluctuates between coil 
and globule conformations (as well as any B globule) so 
that it transiently dissociates thus permitting the small A 
blocks, even in remote locations on the genome, to come 
transiently into contact (see Fig. [^. This corresponds 
to the “multistate folding” region calculated by Jost et 
al. and depicted on Fig. Note that the width of this 
multistate folding region varies as l/ln(n) where n is the 
typical size of the small(est) blocks. 


Below the lower critical temperature Q{N)—a/y/ln{N) 
all the B globules are permanently collapsed in a 
macroglobule with the A blocks located at the macroglob¬ 
ule surface (because of interfacial tension). Crucially the 
A blocks are still coils, hence in the euchromatin phase, 
so that their genomic sequence is expressed, whereas the 
B blocks are globular and as such in the heterochromatin 


phase, hence their underlying sequence is repressed (Care 


et al. 2015). 


IV. PHYSICAL MECHANISMS INVOLVED IN THE 
INITIATION, SPREADING, MAINTENANCE AND 
HERITABILITY OF EPIGENETIC MARKS 


Stem cells are capable of differentiating to the desired 
fate depending on the tissue. Dramatic changes in gene 
expression occur during development. These changes are 
then stabilized and become heritable. Epigenetic modifi¬ 
cations take part in both initiating, stabilizing and prop¬ 
agating the patterns of gene expression. Gene regulation 
by epigenetic modifications is indeed stably propagated 
through cell divisions (and, in some cases, across genera¬ 
tions). At each cell division, the whole DNA is replicated. 
Chromosomes then consist of two sister chromatids which 
both have identical genetic information, joined together 
at their centromere. Then, during mitosis, the two chro¬ 
matids are separated and segregated into the two nuclei 
of the daughter cells. 

Eukaryotic replication involves both DNA synthesis 
and chromatin assembly. As the two double helices are 
synthesized from the two single strands of the mother¬ 
cell DNA, nucleosomes on the mother-cell DNA strand 
should also be distributed to both daughter double he¬ 
lices, and completed by de novo nucleosome assembly. In 
order to ensure the transmission of epigenetic marks to 
daughter cells, mother-cell nucleosomes should be shared 
by both newly formed chromosomes, even if the de¬ 
tailed mechanisms of this distribution are still debated 


(Mac Alpine and Almouzni 2013). 


While it is clear that histone modifications are involved 
in gene silencing, hence gene regulation, the questions of 
how epigenetic marking is initiated, how it may spread 
over specific chromosome regions (and not beyond), and 
how it can be stably maintained along the cell cycle and 
through the cell division are still under investigation. In 
this section, we will review the main modeling efforts that 
have been made in order to address these questions. 


A. Mathematical modelling 

Many recent theoretical works addressed the question 
of how epigenetic marks are initiated, spread, and main¬ 
tained. The main objective of these models is to repro¬ 
duce a few essential features observed in vivo: (a) the 
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multistability of the epigenetic marks; (b) their spatial 
patterns and (c) their heritahility. 

By multistability, it is generally meant that the epige¬ 
netic marks act as switches between different functional 
states. In the simplest case, different patterns of epi¬ 
genetic marks allow to switch between two states that 
have a well-dehned functional characterization (bistabil¬ 
ity). Such functional states are then inherited by the 
daughter cells through mitosis, which is what we call her- 
itability. As observed in genome-wide studies, the epige¬ 
netic patterns correspond to distinct epigenomic domains 
that are separated by boundaries (see Sec. III.C). 

We consider a system of N nucleosomes that can be in 
ns different states. In the simplest case, ns = 2 and one 
refers to “modified” or “unmodihed” states, which can 
be related to active or inactive genes. 

The state of the system is described by the variables 
{si, S 2 , ■ ■ ■, Sat}, where Si is the state of nucleosome i. If 
we define nj as the number of nucleosomes in the state 
j, then one can write the conservation of the number of 
nucleosomes as 


ns 

^rij=N 

i=i 


( 1 ) 


Many theoretical works use the silenced mating-type 
locus of the fission yeast Schizosaccharomyces pombe (re¬ 


viewed in (Grewal and Elgin 20021) as a model system. 


In this system, the region containing the two mating- 
type regions is normally “silenced”, i.e. not expressed. 
The expression of the mating-type genes may become 
bistable in mutants, flipping between a silenced state 


and an active state (Grewal and Klar 1996 Thon and 


Friis 1997). Each state is stable and heritable; transition 


between them occurs apparently stochastically. The S. 
pombe HMT, HDAG and other proteins are necessary for 
silencing, and all may bound H3K9me directly or indi¬ 
rectly. 

In the following, we review the models of this behavior 
proposed so far. 


B. Zero-dimensional models 

In zero-dimensional models, neither the spatial organi¬ 
zation of the N nucleosomes, nor the notion of distance 
are introduced. In general, the model concerns rate equa¬ 
tions on how the variables Uj vary as a function of time, 
and the objective of the models is to show how bistable 
or multistable states can appear. In this class of mod¬ 
els, the initiation of the epigenetic mark is implicitly de¬ 
fined as the initial state of the dynamical system, and the 
spreading is described as the time evolution of the initial 
state. Mitosis can be modeled as an instantaneous pro¬ 
cess in which the concentrations of all species (modihed 
and unmodified nucleosomes) are diluted and the system 


restarts. The dilution is due to sharing of mother-cell nu¬ 
cleosomes between both daugther cells. Nucleosomes are 
not necessarily shared into equal parts between daughter 
chromosomes, but this may be assumed without loss of 
generality as is done for convenience in most models. 

We can write a general expression for the time evolu¬ 
tion of the variables n^: 

1 ns ns 

^ (2) 

k^j k^j 


Here, is the rate of transition of nucleosomes from the 

state k to the state j, while RJf. is the rate of transition 
from state j to k (obviously, = —R~j^). In general, 
these coefficients are not constant, but depend on the 
other dynamical variables. The “noise” may be included 
to describe the effect of stochastic processes involved in 
the system. 

The simplest possible model of this kind was proposed 


by (Micheelsen et al.\ 2010). The authors consider the 


case of ns = 2, that is, they consider only a modified 
(M) or unmodified (U) state. Using equation ([^, the 
system may be described by only one variable riM, the 
number of modified nucleosomes. The transition rates 
are given by 


^UM — + (I — a) 

^MU ~ c>:{^ tim) T (1 ck)- 


( 3 ) 


This model supposes that the creation of a modified state 
involves a cooperative transition (as evidenced by the 
quadratic terms in equation ([^) or a spontaneous con¬ 
version to the unmodified state (which is described by 
the (I — a) term). Despite its simplicity, the model can 
account for the emergence of bistability. The parame¬ 
ter F = a/{1 — a) (feedback to noise ratio) governs the 
behavior of the system. For F > 4, three fixed points 
emerge in the system: riMi = 0 and ums = N, which 
are stable, and nM 2 = A^/2, which is unstable. The F 
parameter is possibly under active control by the cell, 
which then can regulate its function (notably by HDAG 
inhibitors (Dayarian and Sengupta, 20131). Heritability 
can be partially accounted for by this model, since one 
can speculate that cell division brings the system close 
to the unstable point, which then returns to its stable 
attractor. 

David-Rus et al. thoroughly investigated a more gen¬ 


eral model that still has ns = 2 (David-Rus et al. 2009). 
Their rates read: 


^UM — X + Ctklfj 

Rmu = X + (4) 

The first interesting result they obtained is that this 
model can reproduce bistability only for H, K > 1. The 
simple quadratic case H = K = 2 is a generalisation of 
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the model of Micheelsen et al. (Micheelsen et al. 2010), 
where the cooperative transition probability (rate) from 
U to M is independent from that to M to U. If the 
basal rates x s-nd 7 are small, one again obtains three 
fixed points, with the intermediate unstable point being 
nM 2 ~ v)- Assuming now that cell divisions ex¬ 

actly halves the concentration of modified nucleosomes 
for each daugher cell, if nM 2 > ?t-M 3 / 2 , than the system 
will always fall in the basin of attraction of umi after a 
cell division, hence the only stable point is the unmodi¬ 
fied state Umi ~ 0. Conversely, for nM 2 < ?t-M 3/2 (hence 
T] < a), the system will converge to the modified state 
fixed point ums ~ N for initial conditions larger than 
2 nM 2 , and bistability becomes effective. 


This scenario is however modified by the presence of 
noise in the system. In fact, if the probability of transi¬ 
tion from U to M is larger than the probability of tran¬ 
sition from M to U (that is, 77 < a), then the umi fixed 
point is no longer stable. Noise drives the system out 
of the um = 0 state, and brings it to the fully modi¬ 
fied um ~ N state. This consideration highlights the 
importance of asymmetric recruitment rates. 


The same authors also considered the case of ns = 3, 


which was already considered by Dodd et al. (Dodd et al. 


2007 1 in a very similar form. They consider the case of an 


“antimodified” state (A), that is possibly an acetylated 
state (active chromatin mark) that is opposed to the M 
state which is possibly a methylated (repressive) state 
(see Fig. 10 1 ). A hypothesis is that only the C/ —)■ M 
and U —>■ A are allowed, but the M ^ A transition is 
not (i. e. = 0). They write the following transition 

rates: 


Rua = + XA 

^UM — OtMnM + Xm 

Rau ~ + 7 a 

Rmu = ^AnA + lM- (5) 


The study of the system in the case where the basal rates 
Xm, Xm, Xa and ja vanish already shows the existence of 
four fixed points: two stable fixed points, {ua = 1 , um = 
0 } and {uA = 0,nM = 1 }, an unstable saddle point 
{uA = 0 ,nM = 0 } and an unstable fixed point {ua — 
<amI3a/{o:aPm + {(Am +/3m)/3a),’t-m = (aaPm/{(aaPm + 
{aM + /3 m)I3a)}- The two latter points are aligned along 
the UA = nM line and create a barrier between the two 


basins of attraction (David-Rus et al. 20091. The phase 


flow diagram of such system is depicted in Fig. 


The last model of this class that we consider is the one 


proposed by dost (dost, 2014). The author considers a 
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FIG. 9 Phase flow po rtrait for the three-state model described 
by David-Rus et al. (David-Rus et al. 2009), equations 
The values of the parameters here were a a = om = 5, and 
/3 a = fiu = 3. One can clearly observe the presence of the 


stable fixed points and the unstable fixed point. From (David- 


Rus et al. 20091. 


special case of the three-state model outlined above: 

Rua = ^AnA + fco 
Rum ~ (MdM + ^0 
Rau ~ (M^M + ko 

R'lfjj = EAnA + ko , ( 6 ) 


that is, it is the same model with ua.m = Pa,m = (a,m 
and XA,M = XA,M = ko. Interestingly, this particular 
choice allows to map the system to the zero-dimensional 
Ising model, with, e.g., the correspondence A= -|-1, 
U= 0, M= —1. Within this analogy, recruitment corre¬ 
sponds to coupling between spins and random transitions 
are associated with thermal fluctuations. A new observ¬ 
able, equivalent to the magnetizaton in the Ising model, 
is introduced here: ^ = a — m. 

Some known results can thus be recalled for the sym¬ 
metric recruitment case ca = (m = £■ Similarly to 
what previously discussed, three fixed points exist. The 
first one, /r = 0 , is stable for weak recruitment, i.e. 
for e < Cc = Sko- Above this critical value of e, 

= 0 becomes unstable and bistability settles down 
with the appearence of two stable fixed points, fi± = 
±(^o/e)\/(e/fco + l)(e/fco - 3). 


The non-local character of the nucleosome-nucleosome 
interaction, which is the main hypothesis of the zero- 
order models, has been further justified by a recent work 
(Zhang et al. 2014). The authors proposed a two-layer 
Potts model in which in one layer they describe the nu- 
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cleosomes, and in the other they include explicitly the 
enzymes that modify the nucleosomes. The interaction 
between the nucleosomes is the effectively mediated by 
the modifying enzymes. Interestingly, by integrating out 
the effect of the modifiers, it is possible to prove the exact 


equivalence to the model proposed by Dodd et al. (Dodd 


et al. 2007). 


To conclude this section, we stress the main results of 
this comparative analysis. Bistability is obtained by this 
class of models in two ways: in two-state models only 
when including nonlinear rates, and in three-state mod¬ 
els even having linear rates. The reason for this is that 
in three-state models the transitions from a modified to 
an antimodified state can proceed only in a two-step pro¬ 
cess, effectively requiring cooperativity, hence producing 


bistable states (Dodd et al. 2007). 


C. Higher-dimensional models 

An inherent limit of the models discussed above is that 
they cannot reproduce spatial patterning of the epige¬ 
netic marks. Hence, limitation of the mark spreading 
should be included by limiting the extent of the con¬ 
cerned domain, i.e. the total number of nucleosomes. If 
this assumption may be relevant e.g. for the mating-type 
loci in yeast, it probably fails when multicellular organ¬ 
isms are considered. It is known, for example, that nearly 
all noncentromeric H3K9me3 domains in mouse embry¬ 
onic stem cells have a peaked shape, with continuously 


decaying mark densities on both sides (Hathaway et al. 


2012 ). 


1. One-dimensional models. 

Even when mark spreading is surrounded by bound¬ 
aries, the question arises how to model their presence and 
effects. Dodd and Sneppen realize in their 2011 work 


(Dodd and Sneppen 2011) that positive feedback can 


lead to spreading of the modifications to genome regions 
other than the target. They refer in particular to the 
silent mating-type loci in budding yeast Saccharomyces 
cerevisiae. In this organism, the MAT (mating type) 
gene has two variants, MATa and MATa, and switching 
of the mating type occurs when the expressed MAT vari¬ 
ant changes from one type to the other. This mechanism 
is possible because each of the two variants comes also 
with a silenced allele: the HML (hidden MAT left) carries 
a silenced MATa allele, and HMR (hidden MAT right) 
bears a silenced MATa allele. The HML and HMR are 
able to spontaneously flip between high and low expres¬ 


sion states (Xu et al ., 20061, thereby allowing for switch¬ 


ing of mating type. These domains are stable over up 
to 80 cell generations, and are surrounded by boundary 
elements that prevent silencing to spread out of the do¬ 


mains. These “barriers” are specific sequences, and may 
simply be target sites for certain DNA-binding proteins, 
strong gene promoters, or nucleosome-excluding struc¬ 
tures. Dodd an Sneppen therefore consider a model in 
which all nucleosomes are explicitly treated, and the long¬ 
distance interaction between nucleosomes is modeled in 
a “local-local”, “local-global”, or “global-global” scheme 
(see Fig. 10:). To limit the long-range interaction be¬ 


tween DNA sites one can introduce a distance dependent 
cooperativity, i.e. by making the reaction rate Rum de¬ 
pendent on the nucleosome distance. A power law de¬ 
pendence, typical of the three-dimensional probability of 
contact, can be assumed. 

Then, the confinement of silenced regions can be ob¬ 
tained by introducing local barriers, modeled as single 
nucleosomes fixed in the active (A) state. Due to the lo¬ 
cal character of the modification step, a single silencing- 
resistant nucleosome (e.g. H3K4me3 (Venkatasubrah- 


manyam et al. 20071) or a nucleosome-depleted region 


(notably in gene promoters (Hi et al. 2004)) is enough 


to stop the silencing spreading, provided that the flank¬ 
ing regions are entirely in the active state. However, an 
occasional inactivation of the barrier make the silencing 
spread out. This effect can be limited by introducing 
regularly spaced weak barriers, modeled as anti-silencers 
(enhancers) of the U —>■ A reaction, or by implementing 
in the model a Michaelis-Menten saturation effect when 
the number of U state nucleosomes increases. The com¬ 
bination of both effects results in robust prevention of 
silencing spreading. 

Focusing instead on mammal silenced regions, |Hath-| 
away et al. {2012 ) were able to reproduce the sharp peaks 


observed in the experimental modification patterns by in¬ 
cluding a “source” term in their model. This is a model 
in which the initiation and spreading are explicitly sepa¬ 
rated, and in turns this allows to reproduce spatial pat¬ 
terning. They write rates as 


Uo 

{Af^, or i—1)} 


Mo 


{Mi, or i-1)} 


M, t/,. 


(7) 


This description means that at site 0 there is an active 
modification source with rate which then spreads 

to the neighboring nucleosomes with rate k+. Fitting 
to experimental results leads to and k- rates both 
of the order of 0.1-0.2 h“^ (in agreement with different 
experimental estimates of k_. However, as pointed out 
by the authors, this model fails to predict the bistable 
nature of the system, thus not allowing to describe this 
crucial feature. 


In Ref. (Hodges and Crabtree 20121 a more detailed 


study of the model is presented. The source term en¬ 
sures that the resulting mark distribution are peaked at 
the nucleation site, as experimentally observed, provided 
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is large enough (> 0 . 2 ), with increased ampli¬ 
tude and formation rate for increasing 

Still referring to the mating-type loci in budding yeast, 


Dayarian and Sengupta (20131 consider a four-state 
model with site-dependent rate equations. The fourth 
state they consider is a double-acetlyated state, which 
would correspond to acetylation of two H4K16 sites. Im¬ 
portantly, in this model, the modified state M is supposed 
to be a state where nucleosomes are bound to silencing 
(Sir) proteins, and depends therefore on their availability 
(concentration). In its most general form, this is a one¬ 
dimensional model that explicitly describes cooperative 
transitions that involve any nucleosome pair. However, 
it can be simplified into a zero-dimensional model when 
considering uniform solutions, which again show bista¬ 
bility and a characteristic bifurcation diagram. More¬ 
over, such concentration-dependent model allows for ad¬ 
ditional interesting effects, involving a fine balance be¬ 
tween the silencing of mating-type loci, which have a def¬ 
inite extent, and of the telomeres, whose extent may vary 


depending on the protein availability (Dayarian and Sen¬ 


gupta 20131. Interestingly, indeed, this model also allow 


the existence of a silenced and an active domains in sta¬ 
ble coexistence, i.e. with an immobile boundary domain, 
whose position depends on the balance between environ¬ 
mental self-adjusting parameters as the concentration of 
active proteins, a mechanism that these authors explored 


extensively ( 

Dayarian and Sengupta 

2013 Mukhopad- 

hyay and Sengupta 

2013 

Sedighi and Sengupta 

2007 

1 . 


2. Three-dimensional models. 


Erdel et al. (Erdel et al. 2013) addressed some more 


specific questions about the establishing of epigenetic do¬ 
mains, as how are the chromatin modifying enzymes tar¬ 
geted or excluded from given chromatin regions, and how 
exactly the modification can propagate from one nucleo¬ 
some to another, how is this state reestablished or main¬ 
tained during replication. The proposed model focuses on 
the permanent binding of enzymes to a scaffold, either on 
chromatin itself or on the nuclear membrane, this leading 
to the definition of a limited chromatin region allowed to 
interact with the enzyme by short-range diffusion. The 
spatial distribution of the enzyme hence may result in 
a spatially limited enzymatic activity, and results in the 
definition of epigenetic domains. This first attempt to 
take into account the chromatin architecture in a three- 
dimensional model is noteworthy, despite the difficulty 
in estimating many of the geometrical and physical pa¬ 
rameters involved in the model, as the linear base-pair 
density along the chromatin fiber, the fiber stiffness, or 
the nucleosome local density. Moreover, the question of 
how the set up of the correct architecture in the initial 
enzyme binding and in defining the functional chromatin 
domains remains open. 


It also has been recently proposed that pericentromeric 
heterochromatin spreads its silenced state with a “nucle- 


ation and looping” mechanism (Miiller-Ott et al. 20141. 


Chromatin-bound SUV39H1/2 complexes would act as 
nucleation sites and propagate a spatially confined het¬ 
erochromatin domain with elevated H3K9me3 modifica¬ 
tions via chromatin loops. It is therefore relevant to in¬ 
clude three-dimensional structure in the theoretical mod¬ 
eling of the spreading of epigenetic marks. 


D. Biological relevance of the models 

In this section we intend to examine the biological rel¬ 
evance of a few key points that emerged in the discussion 
of models of initiation and spreading of epigenetic mod¬ 
ifications. 


1. Waddington's epigenetic landscape revisited. 

First, let us return to the discussion on the Wadding- 
ton landscape that we started in the Introduction. The 
classical image in the original Waddington representation 


(Waddington 1957) of a marble rolling down a hill does 


rather suggest a fixed landscape, leading to erroneous 
interpretation when one goes beyond the metaphorical 
level (see Fig. [^. 

In the simplest model we discussed, the one by 
Micheelsen et al. (Micheelsen et al. 2010), the authors 


show that the model can be reformulated by a Fokker- 
Planck equation for the ID diffusion of a particle in an 
effective potential V{nM) (see Fig. [ToIj). The latter ac¬ 
counts at a time for drift (external forces) and noise 
events (with a term of the type D/^, with D the diffusion 
coefficient and fi the mobility). The Waddington idea 


of an epigenetic landscape is translated in (Micheelsen 


et al.\ 20101 in more modern terms, by defining a phys¬ 


ically consistent energy profile. Note however than the 
mechanism invoked here is not an evolution along the 


profile of Fig. 10 d toward the minimum energy states, 
since different values of the F parameter correspond to 
different system parameters, hence different external con¬ 
straints. In other words, the equivalent of an epigenetic 
landscape corresponds here to a given F = constant sec¬ 


tion of the two dimensional potential surface of Fig. 10 r. 
This allows in turn to suppose that external constraints 
may be included in the parameter F, which may vary 
as a function of metabolism (level of activity) or drug 
delivering of “writers” or “erasers” (see Sec. III.B), no¬ 


tably HD AC inhibitors (Dayarian and Sengupta, 2013), 


thus typically making the system switch from bistable to 


monostable conditions. As discussed by dost (dost 20141, 


this may also represent a strategy to gain in system sen¬ 
sitivity hence plasticity during development. Note that 
the switching mechanism between bistable and monos- 
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FIG. 10 A modern view on epigenetic landscapes, (a) The basic mechanism of a three-state nucleosome modification model, 
depicting the modified (M), unmodified (U) and antimodified states (A). The transition between M and U is catalyzed by 
histone methyltransferases (HMTs) and histone demethylases (HDMs), which depends on an antimodified histone; between U 
and A the tran sition is catalyzed by histone acetylases (HATs) and histone deacetylases (HDACs), which depends on a modihed 
histone. From (Dodd et al. ] [2007t . (b) The coarse-grained potential V (m) as defined in the main text, as a function of the mean 
fractional number of modified nucleosomes m, and the feedback-to-noise ratio F (from (|Micheelsen et ai\ 20101). (c) Different 
modes of coupling between histone modification states, as described in ( |Dodd and Sneppen[ |201ip . Unrecruited enzymes may 
modify histones directly. Otherwise, recruited enzymes may operate in non-cooperative (global or local) or cooperative (local- 
local, global-local, or global-local) modes. From (Dodd and Sneppen 20111. (d-h) Illustration of the “epigenetic landscape” 
as proposed by Jost ( Jost[ 20141. As a function of the control parameters tA and ei, the system may undergo a transition 
between an inactive, active or bistable state. As shown in (h), the system may also develop a hysteretic behavior, (d-h) from 


(Jost 20141. 


table conditions can be interpreted as the result of an 
active process bringing the system out of bistability and 
favoring its switching to a different state. 


We then stress that it is important to consider the 
asymmetry of the modification rates. Taking the no¬ 
tation of Ref. (Jost 2014| ) (equations we notice 

that if recruitment of enzymes by modified or anti¬ 


modified marks are different, the stability diagram and 
the boundaries between the mono- and bistable regions 
can be traced as a function of the two parameters ca and 
Cm- Bistability is observed only for strong recruitment 
< Cc) and small asymmetry. 

The epigenetic landscape may also be viewed as a com¬ 
plex, multi-dimensional dynamical system in which dif- 
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ferent cell identities correspond to different dynamical at¬ 
tractors of the system. In one approach, such landscape 
is modelled to be shaped by gene regulatory networks. 
In a recent study, it has been proposed that stem cell 
differentiation may be viewed as the process of transi¬ 
tion between steady-state attractors of genes that induce 
or repress cell pluripotency (Zhang and Wolynes 2014). 
Also, it has been suggested that the cellular identities are 
characterized by a mixture of several states, and exter¬ 
nal signals may drive the transition from one cell state 
to another one. By analyzing existing data sets, |Lang| 


et al. (2014) have been able to provide direct evidence 


for this, demonstrating that epigenetic landscapes are a 
very powerful tool to understand cellular dynamics. 


2. Hysteresis 


For even stronger recruitment, a typical hysteretic be¬ 
havior appears that may have important biological con¬ 
sequences. One can expect indeed that, while for dif¬ 
ferentiated, stable cells recruitment parameters are al¬ 
most symmetric, modifications of the environment might 
actively induce asymmetric recruitment. The increase 
of one recruitment parameter can thus bring the sys¬ 
tem along the metastable branch, then make it abruptly 
switch to the alternative state, which will then remain 
stable even when the recruitment parameters comes back 
to their initial values, thanks to the hysteretic shape of 
the bifurcation curve. In Fig. [Tol i, starting for instance 
from the low m state and symmetric recruitment, one can 
increase cj/ko and switch to the upper, high m branch, 
then come back to = e/ without switching back (see 
also Fig. [lO^-h). 

Close to eA ~ cm ~ Co the system becomes ultra¬ 
sensitive to perturbations, and highly unstable. This 
regime may be associated to diseases. A pathological in¬ 
crease in the frequency of replication, for instance, may 
result in an increase of the random transition rate kg, 
which in turn may bring the system close to the critical 
point and induce epigenetic instability and misregulation. 


However, the existence of a critical region may also 
represent an advantage. During development, the ability 
to switch between two coherent states when applying a 
weak asymmetric signal (the developmental signal) may 
facilitate developmental transitions. Since the random 
transition rate kg may be increased by reducing the cell 
cycle, the system can be brought closer to the critical 
region and the switch induced by the application of a 
weak asymmetric signal during a finite period of time 
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FIG. 11 Mechanism of vernalization. The expression of floral 
repressor gene, FLC, is repressed when plants are exposed to 
cold and remains stably repressed on the return to warm tem¬ 
peratures. Since this repression increases with the duration 
of the exposure to cold, flowering is more abundant for longer 
cold duration (from (Song et al. 2013b|). 


E. Example: plant vernalization 


The 3-state model proposed by Dodd et al. (Dodd 


et al. 2007) has been successfully adapted to the descrip¬ 


tion on vernalization, the mechanism allowing plants to 
flower after a prolonged cold period. 

Plants have the ability to measure the duration of 
a cold season and to remember this prior cold expo¬ 
sure in the spring. In Arabidopsis thaliana^ an annual 
plant, a prolonged cold exposure progressively triggers 
the H3K27me3-mediated epigenetic silencing of Flower¬ 
ing Locus C (FLC), a locus encoding for proteins that 
in turn act as flowering repressors. The accumulation of 
histone epigenetic marks in the FLC locus keeps increas¬ 
ing during the cold. This slow dynamics of vernaliza¬ 
tion, taking place over weeks in the cold, generate a level 
of stable silencing of FLC in the subsequent warm that 
depends quantitatively on the length of the prior cold. 
Then, once the FLC is switched off, the silencing persists 
at the return of the warm season, and is mitotically sta¬ 
ble through the rest of the development (often for many 


months) (see Fig. 11) (Song et al. 2013b). This latter 


feature is characteristic of annual plants, while in FLC 
perennial plants is repressed only transiently. 


Satake and Iwasa (Satake and Iwasa 2012) show that 


this behavior can be accounted for by means of the Dodd 


3-state model (Dodd et al. 2007), provided that an ex¬ 


plicit dependence on temperature of the model parame¬ 
ters in included. Explicitly, the transition rates are writ¬ 
ten in this case as 


Ru^a = Pua + x 

(8) 

Ru- 

>M = u{T){j3nM + x) 

(9) 

Ra- 

■>c/ = v{T){/3nM + x) 

(10) 

Rm- 

■^u = £{/3nA + x), 

(11) 


where u(T) and v{T) account for the temperature tuning 
and takes different values in warm conditions before ver¬ 
nalization, in cold conditions during vernalization, and in 
warm conditions after vernalization. Transition rates are 
in fact under the control of a series of proteins (and in 
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particular Vernalization Insensitive 3, VIN3) whose ex¬ 
pression is temperature dependent. Authors prove that a 
strong feedback, hence bistability, is necessary to repro¬ 
duce the experimental observations. Interestingly, when 
the system evolution is simulated, the M o A transition 
is observed at a random time during the cold, for a given 
system containing N nucleosomes (i.e. a given cell). Dif¬ 
ferent cells switch therefore to the repressed state after 
different delays after the change from warm to cold. How¬ 
ever, the average over a cell population leads to a typical 
behavior that can be reproduced, if the cell population 


is large enough (Satake and Iwasa 2012). The duration 


of winter memory is also tuned by model parameters, 
and in particular by those accounting for to call division 
rate and rapidity of deposition of epigenetic marks after 
vernalization. Changes in these parameters may lead to 
a much short memory extent (from more than one year 
to a few days), this potentially explaining the different 
behavior observed in annuals and perennials plants. 

While the previous work addressed the question of 
bistability behavior in vernalization, the question of the 
establishment of epigenetic marks induced by cold is dis¬ 


cussed by Angel at al. (Angel et al.. 2011), both theoret 


ically and by experiments in Arabidopsis thaliana. These 
authors focus on the fact that, when subjected to cold, 
repression (H3K27me3, M state) only concerns a small (1 
kb) nucleation region inside the FLC (8 kb), close to the 
first exon (coding region) after the promoter. While dur¬ 
ing the cold only the nucleation region is marked, after 
warm restoring the profile changed rather little in the nu¬ 
cleation region but rose quantitatively across the rest of 
the FLC locus according to the length of the cold period. 
They ask whether the small size of the nucleation region 
would be sufficient to cause a quantitative switch in the 
epigenetic state of the whole FLC locus after return to 
the warm. Experimental results are shown to be compati¬ 
ble with a 3-state zero-dimensional model, provided that 
two supplementary ingredients are included: (i) a site- 
specific nucleation of the silencing modification during 
cold, described as an increased probability to switch to 
the M state for a sub-ensemble of the nucleosomes, and 
(ii) a permanent bias in the histone dynamics towards 
the M modification on return to the warm. Within these 
assumptions simulated population-averaged levels of the 
M modification are found to be approximately stable up 
to 30 days after the cold, with a modification intensity 
which depend on the duration of cold, in good agreement 
with experiments. 


Together, these two studies show that relative simple 
models displaying a strong bistability can be usefully em¬ 
ployed to model epigenetic mechanisms involved in real 
systems as, in the case discussed here, in plants, even 
if real system typically includes a few additional features 
needed to specifically respond to the particular functional 
task they are designed for. 


V. TOWARD A MORE COMPLEX SCENARIO: DNA 
METHYLATION, ROLE OF RNAS, SUPERCOILING IN 
EPIGENETICS 


Up to now we have focused on histone PTMs and pre¬ 
sented them as a crucial issue in the transmission of epi¬ 
genetic information. However, the global picture is more 
complex. Among the additional epigenetic mechanisms, 
some are known since a long time, as DNA methylation 
(see Sec. V.A), while others have been evidenced quite 
recently, as chromosome coating with (long) non coding 
RNAs as in X inactivation (see Sec. V.C), messenger (i.e. 
protein-coding) RNA silencing by interaction with micro 
RNAs (see Sec. V.D), or the coupling between epigenetics 
and supercoiling (see Sec. V.E). An exhaustive descrip¬ 
tion of the overall picture would represent a titanic task, 
well beyond the aim of this introductory review. There¬ 
fore we focus here on the main physical aspects of these 
biologically relevant mechanisms, drawing on a few con¬ 
crete examples. 


A. DNA methylation 


Historically, DNA methylation has been the first epi¬ 
genetic mark to be recognized as a “stable, inheritable 
chemical modification that alters gene expression and 
does not modify the sequence” (see Sec. [l|. In fact, in 
early days of research on DNA methylation, it was found 
that methylation states are propagated through mitosis 


(Wigler et al. 1981). 


DNA methylation is the substitution of a methyl 
(—CiLa) group to the carbon atom in position 5 at the 
cytosine base (5mC). Importantly, DNA methylation is 
coupled to metabolism through SAM (see Fig. 12 1 ). 

The prevalence of DNA methylation in the genome 
changes significantly among different organisms: it is 
very high in vertebrates (where one refers to a “global” 
methylation), very low in Drosophila, and absent in the 
nematode worm C. Elegans. In somatic cells, cytosine 
methylation occurs predominantly at CpG dinucleotides, 
although it has been detected in any sequence context 


both in plants (Cokus et al ., 2008) and humans (Lister 


et al. 2009), where 70-80% of CpG dinucleotides are 


methylated. 

The patterns of DNA methylation in the genome 
are established in early development, and then faith¬ 
fully propagated throughout successive cell divisions. 
Crucially, tissue-specific genes are kept unmethylated, 
whereas the others are heavily methylated. These pro¬ 
cesses are catalyzed by DNA methyltransferases (DN- 
MTs). It is generally thought that the two methyl¬ 
transferases DNMT3A and DNMT3B are responsible 
for establishing the methylation pattern during devel¬ 
opment {de novo methylation), and DNMTI propagates 
the methylation pattern to daughter cells {maintenance 
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FIG. 12 Physical aspects of DNA methylation. (a) Conversion of cytosine to 5-methylcytosine occnrs using SAM (S- 
adenosylmethionine) as a methyl group donor. Spontaneous de-amination may convert 5mC to thymine, leaving the methyl 
group in the major groove, (b) Structural similarities between 5mC and thymine in the DNA major groove, (c) Specific pat¬ 
terning of H-bond donors, acceptors, methyl groups and non-polar hydrogens allows for “base readout” of the DNA sequence 
without strand opening (see main text), (d) and (e): experimental cyclization experiments on methylated and unmethylated 
DNA show distinct elastic properties of the two species (hgures taken from ( [Perez et al. 2012| |. In (d), 2D electrophoresis 
shows the different migrations of linear (L) and circular (C) DNA species (either covalently closed (c) or nicked open (o)) 
for nonmethylated and methylated oligomers of 21 bp, respectively, (e) Ratio of circular and linear species as a function of 
fragment size. 


methylation) (Bird 20021, which we illustrate in Fig. 13 


The precise mechanism behind the establishment of the 
initial methylation pattern during development is largely 
unknown. It has been proposed (Khraiwesh et al. 20101 
that initiation of epigenetic silencing by DNA methyla¬ 
tion depends on the ratio of the miRNAs and their tar¬ 
get messenger RNAs, in a so-called “RNA-directed DNA 
methylation”. 


5-methylcytosine can convert to thymine by sponta¬ 


neous de-amination (see Fig. 12 1 ), leading to a common 
DNA mutation. The hydrophobic methyl group in the 
DNA major groove gives a structural similarity between 
thymine and 5mC (see Fig. [T^). It is important to 
notice that this allows for the possibility of a “base read¬ 
out” in the major groove, as proposed by some authors 
(Machado et al.. 20151. We shall discuss the implications 
of this later. 


From the biological point of view, the role of DNA 
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FIG. 13 Methylation transmission. After replication, only 
hemi-methylated bases are converted into fully methylated 
bases. 


methylation is not clearly understood. Early studies on 
the role of DNA methylation highlighted its importance 


in gene silencing (McGhee and Ginder, 1979 

Razin and 

Gedar| 1991), in X-chromosome inactivation (Graves 

1982 Mohandas et al ., : 

981 Venolia et al. 

198: 

1) and 

gene imprinting (Li et al. 

1993 Razin and Gedar 

1994). 


It was later established that when CpG-island promoters 
are methylated, then the gene will be irreversibly silenced 


(Jones, 2012). With the advent of technologies that en¬ 


able for genome-wide screening of the methylation state 
of DNA, it has become clear that gene silencing is not the 


only role of DNA methylation (Jones 2012 Lister et al. 


2009), and its biological role is highly dependent on the 


sequence context in which it may be found. For example, 
DNA methylation has been associated with active gene 


bodies (Chodavarapu et al. 2010) quite the opposite of 


its established role of gene silencing. Detailed analyses 
of the differences in the methylation patterns in different 


cell types revealed an even richer phenomenology (Lister 


et al. 

2009 

Vlarchai and Miotto[ |2015[ |Spruijt and Ver- 

meulen 

2014 

), suggesting that the role of DNA methy- 


DNA methylation has also been implicated in a vari¬ 
ety of human diseases (see (Machado et al. 2015) for an 
extensive review), in particular in cancer, and has there¬ 
fore received enormous attention. A challenging issue is 
the relation of DNA methylation to cancer progression 


and prediction of pre-cancerous cell state (Feinberg and 


Tycko 

2004 

Timp and Feinberg 

2013 


Can the change in physical properties of DNA upon 
cytosine methylation help understanding the variety of 
its roles? To address this question, we shall discuss sev¬ 
eral aspects of the physics involved in cytosine methyla¬ 
tion. In the following, we will review the available knowl¬ 
edge on the following aspects: (a) the change in the elas¬ 
tic/mechanical properties of DNA upon cytosine methy¬ 
lation; (b) the role of the hydrophobic methyl group in 
determining DNA-protein interactions; (c) the relation¬ 
ship between DNA methylation and chromatin structure 
in vivo. 


1. Mechanical properties of DNA change upon methylation 


It has been recently established that the mechanical 
properties of DNA change when cytosine is methylated. 
The extent of this change is still unclear though. 

A combination of all-atom molecular dynamics simula¬ 
tions and cyclization experiments revealed that a single 
cytosine methylation at a CpG dinucleotide step has a 
significant impact on the mechanical properties of DNA 


(Perez et al. 2012). Gyclization experiments allowed to 


prove that oligomers stiffen significantly upon methyla¬ 
tion (see Fig. [T^-e). Moreover, the value of the base 
pair roll was found to increase, whereas the twist and 
the width of the minor groove decreased. This should 
lead to a bending of the base pair towards the DNA ma¬ 
jor groove, and a stiffening of the sequence. However, 
when a poly-dinucleotide of type d(GpG)„ was methy¬ 
lated, no significant difference was found compared to its 
unmethylated counterpart. This is due to the fact that 
the GpG dinucleotide has a compensating effect on the 
change of mechanical properties of the GpG step. 

The stiffening of DNA upon methylation was also pre¬ 
dicted by another theoretical study that employed van 


der Waals density functional theory (Yusufaly et al. 


2013). There, it was shown that “Methylation of GG- 


rich stretches of DNA enhances the formation of the A- 
DNA polymorph, a helical form that is more resistant 
to bending deformations than B-DNA, and which also 
bends DNA in the opposite sense. Gonsequently, inter¬ 
actions with the histones are inhibited, and nucleosome 
formation is suppressed.” Moreover, by combining single¬ 
molecule cyclization experiments with all-atom molecu¬ 


lar dynamics simulations, Ngo et al. (2016) showed that 


5mG increases the DNA local rigidity, and also reached 
the conclusion that this destabilizes nucleosomes. 

In mixed sequence DNA, no significant effect of DNA 
methylation was observed by cyclization of 158-180 bp 


fragments (Hodges-Garcia and Hagerman 1995). How¬ 


ever, when combining detailed Monte Garlo simulations 
with cyclization experiments of an FcoRI restriction site. 


Nathan and Grothers (Nathan and Grothers 2002) found 


that methylated sequences change the flexibility and the 
twist rate of DNA. The emerging picture is that cytosine 
methylation changes local structural parameters of base 
pair sequences, leaving however unperturbed the global 
elastic and mechanical properties of DNA. 

Another consequence of cytosine methylation is altered 
resistance to strand separation (Severin et al. 2011). By 


combining single molecule force experiments with all¬ 
atom molecular dynamics simulations, the authors show 
that strand separation is strongly affected by cytosine 
methylation. It inhibits or facilitates strand separation, 
depending on the sequence context. Again, the sequence 
context plays an essential role in determining the direc¬ 
tion and extent of the impact of DNA methylation. 
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2. Impact of cytosine methylation on DNA-protein interactions 

The addition of a methyl group in the major groove 
of cytosine bases alters the hydrophobicity of DNA at 
the base pair step. In crystallographic studies, it was 


affects the structure of DNA wrapped around a his¬ 
tone core. Using fluorescence resonance energy transfer 
(FRET) to measure relative displacement between two 


sites on nucleosomal DNA, Lee and Lee (20111 showed 


that methylation leads to tightening of DNA around the 


found that a methylated A-form DNA oligomer (Mayer- nucleosome. Along the same lines, Choy et al. (20101 sug- 


Jung et al ., 1998) is well hydrated, thereby allowing for 


the possibility of specific recognition of methylated DNA 
sequences, through the interaction with the tightly bound 
water at the methyl group. 

Three classes of proteins that specifically bind to 
methylated DNA are known: MBD (methyl binding do¬ 
main) proteins, SRA (SET and RING associated-) do¬ 
main proteins, and zinc-finger proteins (Buck-Koehntop 


gest that the enhanced rigidity of methylated DNA leads 
to more compact and closed nucleosomes. Methylated 
DNA was also shown to be more difficult to remove from 


a nucleosome (Kaur et al. 20121. By combining data 


and Defossez 20131. Some crystal structures of proteins 


that specifically recognize methylated DNA have recently 


become available (Buck-Koehntop et a/. 2012 Lin et al. 


20141. The detailed analysis of the binding modes re¬ 


vealed that although similarities between them exist (Liu 
et al. 2013), it is yet unclear why three distinct families 


from genome-wide nucleosome positioning with available 
DNA methylation maps, Chodavarapu et al. ( |2010 ) have 
shown that there is a small increase (about 1.2%, from 
75 to 76.2%) in preference of nucleosome positioning for 
methylated DNA. This data has been contested by |Felle| 
et al. (2011 | ), that instead showed that the nucleosome oc¬ 
cupancy of methylated DNA was 2-fold lower compared 
to unmethylated sequences. 

Other studies showed that the increased rigidity of 
methylated DNA disfavors nucleosome positioning. We 


of methylated DNA-binding proteins were needed in the 
course of evolution. The sequence flanking the methy¬ 
lated CpG step was shown to be important in determin¬ 
ing the specihcity of the interaction. 

The structural differences between methylated and un¬ 
methylated DNA may in part explain the specihcity of 
interactions between proteins and DNA. As shown in Fig. 
[T^ , the DNA base pair text determines a specihc pattern 
of chemical groups (hydrophobic, hydrogen bond donor 
or acceptor) in the major groove, but not in the minor 
groove. As a consequence, one proposed mechanism of 
recognition of methylated states involves a “base read¬ 
out” in the major groove, that does not require strand 


already discussed the works of Yusufaly et al. (2013) and 


Ngo et al. (2016), but many other studies reached the 


same conclusions ( 

Mle et oL 

2011 

Perez et al. 

2012 

Portella et al. 

2013 

1 . 


Even more recent studies have shown that methyla¬ 
tion has a negligible inhuence on nucleosome stability 


(Langecker et al. 2015). 


4. Remarks and perspectives 

It has been discovered that 5-methylcytosine is not the 
only cytosine variant. In 2009, 5-hydroxymethylcytosine 


separation (Sasai et al. 2010 Zou et al. 2011). 


was discovered in mouse brains (Kriaucionis and Heintz 


Yet another consequence of DNA methylation is the 2009), and since then, two other forms of cytosine methy¬ 


lation were discovered: 5-formylcytosine (5fC), and 5- 


roll and minor groove width, 
found that roll and minor groove width were excellent 
predictors of sequence specificity for DNasel endonucle¬ 
ase. This mechanism of recognition of methylated se¬ 
quences is termed “shape readout”. It is likely that 


Lazarovici et al. 

(2013 

) carboxymethylcytosine (5caC) (see I 

Spruijt and Ver- 


has been detected in mouse embryonic stem cells in sig¬ 


nificant amounts (Song et al.. 2013a), so that careful in¬ 


vestigation of the differences between their physical prop¬ 


Machado et al. 

2015). 

et al. 

2016 


erties is needed, which only very recently has begun (Ngo 


3. Relationship between nucleosome positioning and DNA 
methylation 

As discussed earlier, it appears that CpG methylation 
locally stiffens the DNA, and it was speculated that as 
a consequence, nucleosome positioning would be disfa¬ 
vored. However, contradictory results exist, which also 
suggest that methylation actually favors nucleosome for¬ 
mation, or stabilizes DNA wrapped around a nucleosome. 
We discuss here the available data. 

Several lines of evidence suggest that methylation 


Traditional sequencing methods for detection of cy¬ 
tosine methylation are not able to distinguish between 
5mC, 5hmC, 5fC and 5caC. Accurate methods to iden¬ 
tify the genome-wide map of cytosine methylation states 


has only very recently become available (Booth et al. 


2014). Much study is needed to understand the roles of 


these epigenetic marks. 

We speculate that distinctive physical properties of 
these alternative forms are in part responsible for de¬ 
termining the variety of roles attributed to DNA methy¬ 


lation (Schiibeler, 2015 Spruijt and Vermeulen 2014). 


We suggest that physics may help advancing our un¬ 
derstanding of the biological aspects of this important 
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epigenetic mark. A very recent study has also shown 
by all-atom molecular dynamics and single-molecule flu¬ 
orescence that DNA methylation is able to promote at¬ 
traction between DNA fragments mediated by polyvalent 


cations (Yoo et al. 20161. This study suggests yet an¬ 


other fascinating role for the role of physics in determin¬ 
ing the large-scale organization of DNA in cell nuclei. 

Finally, there is already a vast literature on the 
coupling between DNA methylation and histone post- 


translational modifications (see Sec. III.Bl, which was 
reviewed in (Cedar and Bergman 2009). However, to our 


knowledge there has not been yet an attempt to model 
this coupling from a physical point of view. 


B. Parental imprinting 

In animals, it has been observed that certain genes are 
expressed in a parent-of-origin-specific manner. These 
so-called “imprinted” genes are DNA methylated on 
specific sequences named Imprinting-Control Regions 
(ICRs). The ICR methylation of any imprinted gene oc¬ 
curs either on the paternally herited or on the maternally 
herited chromosome this gene belongs to. Remember 
that there is one copy of each gene on the paternal and 
on the maternal chromosome (except for sexual chromo¬ 
somes). Both copies are generally different and are called 
paternal and maternal alleles. ICR methylation starts in 
germ cells, keeps the memory of the parental origin of 
the allele and drives monoallelic expression. For exam¬ 
ple, this mechanism happens on the Igf2/HI9 gene locus 


of mouse chromosome 7 (see Ref. (Lesne et al.\ 20151). 


The ICR located upstream of the H19 gene is methy¬ 
lated on the paternal allele but it remains unmethylated 
on the maternal allele. The maternal unmethylated al¬ 
lele is bound by the transcriptional repressor CTCF pro¬ 
tein (CTCF means CCCTC-binding factor; CCCTC is 
a DNA sequence), that prevents the interaction between 
regulatory sequences (enhancers) located downstream of 
H19 and the Igf2 gene located further upstream of the 
ICR. Therefore, the Igf2 gene is not activated on the 
maternal chromosome 7. Instead, on the paternal chro¬ 
mosome, the DNA methylation prevents the binding of 
the CTCF protein and the Igf2 gene can be activated 
by the regulatory sequences. This differential folding 
has been evidenced by Chromosome Conformation Cap¬ 


ture (3C) experiments during mouse development (Court 

erZj[2m| ). 

Genomic imprinting was selected during evolution at 
the transition between placental (e.g. mouse) and mar¬ 
supial (e.g. kangaroo) mammals. Six genes in marsupi¬ 
als and about a hundred in placental mammals undergo 
parental genomic imprinting. The reason for the selection 
of this unusual, epigenetic mechanism of gene regulation 
during mammalian evolution remains poorly understood 
at present. 


C. Chromosome X inactivation 

Another important and historically relevant example 
of epigenetic silencing is the inactivation of the X chro¬ 
mosome in mammal females. 

The pair of sex chromosomes (XY in males, XX in fe¬ 
males) is responsible for sex determination in mammals. 
While the Y is small and carries only a few genes, X 
chromosome is much longer and contains thousands of 
genes. Females thus carry twice as many X-linked genes 
as males, this leading to a potentially lethal dosage prob¬ 
lem. 

During early embryonic development, one of the two 
X chromosomes of females is thus inactivated, and con¬ 


densed to form the so-called Barr body (Barr and 


Bertram 19491. 


In mice, the inactivation of the X chromosome occurs 
in three phases. First, the paternal X chromosome is 
inactivated during the preimplantation period, from the 
stage “two cells”. Then, it remains inactive in the pe¬ 
ripheral cells, which will form the placenta, but it is re¬ 
activated in the cells which will form the future embryo. 
Finally, a second inactivation takes place, and this time 
it concerns, randomly in each cell, either paternal or ma¬ 


ternal X (Okamoto et al.. 20041. 


Intriguing questions then arise: how does the cell man¬ 
age to silence only one of the two X chromosomes? How 
is the silenced X chosen? 

The X inactivation mechanism seems to be controlled 
by a complex genetic locus called the X-chromosome- 
inactivation centre (Xic). It has been proven that start¬ 
ing mechanism in X inactivation is mediated by the non¬ 
coding (not translated into protein) transcript of the 
Xist (X-inactive specific) gene, present within Xic. Once 
transcribed, many copies of Xist RNA accumulate along 
the X chromosome (RNA coating), then induce its hete- 
rochromatinization. 

This process is however under the control of a few other 
genes included in the Xic region. One of the crucial ele¬ 
ments is Tsix, a non-coding RNA gene that is antisense 
to the Xist gene (it is transcribed from the complemen¬ 
tary DNA strand). Due to this compementarity, the Tsix 
RNA-transcript duplexes with the complementary Xist 
RNA-transcript into a double stranded RNA which is 
further degraded. This mechanism prevents the accumu¬ 


lation of Xist, hence inhibits the X inactivation (Okamoto 


et al.. 2004). 


Random selection of the inactivated X chromosome 
may therefore emerge from a detailed balance in the syn¬ 
thesis of Tsix and Xist. Recently, a model to explain this 


complex regulation path has been proposed (Giorgetti 


et al. 2014), that relies on the polymer physics prop¬ 


erties of chromatin, on its organization in topologically- 


associating domains (TADs, see Sec. HI.C.2), and on a 


detailed coupling between gene expression and 3D orga¬ 
nization at the level of the Xic center. 
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The Xic is composed of two topologically-associating 
domains (TADs, (Nora et al.. 2012)), called the Tsix 
TAD (320kb) and the Xist TAD (600kb). The 3D struc¬ 
ture of the Tsix TAD is highly variable among cells and 
this variability is most probably due to fluctuations of 
chromatin conformation at time scales shorter than a cell 
cycle, as in the model proposed by Jost et al. previously 
discussed (Jost et al. 2014). The distribution of con¬ 
formations observed thanks to 5C experiments (Carbon 
Copy Chromosome Conformational Capture), illustrated 
by the Fig. 5A in (Giorgetti et al.. 2014), shows indeed an 
equilibrium between coil and globule conformations, typ¬ 
ical of a coil-globule transition of a polymer with finite- 
size effects (Care et al. 2014 Imbert et al. 1997). 

The model also assumes that the level of Tsix depends 
on the activity of two putative regulatory elements (Linx 
and Chid, (Nora et al. 2012|), placed inside the same 
Tsix TAD. 

Inactivation can then be explained as a result of the 
chromatin conformation of the Tsix TAD, which deter¬ 
mines the regulation of specific interactions between all 
these elements (regulatory elements, Tsix and Xist). The 
switch between globule and coil confirmations changes 
indeed the spatial proximity between these genes, hence 
their interactions. As a result, globule conformations in¬ 
duce higher Tsix transcription levels, while coil confor¬ 
mations correspond to lower Tsix levels. 

Statistical fluctuations in chromatin conformation 
within the Tsix TAD may, then, contribute to ensuring 
asymmetric expression from the Xic at the onset of X 


chromosome inactivation, as shown by simulations (Gior¬ 


getti et al. 2014). If, in a cell, Tsix TAD is similarly 


compacted on the two alleles, Tsix transcript levels from 
the two alleles are similar, with little or no heterochro- 
matinization effect. As a fluctuation induces the coil- 
globule transition for the Tsix TAD on one allele, then 
the two transcripts tend to be differentially expressed. 
This mechanism may help ensuring that Xist is only 
transcribed from the allele with lower Tsix transcription 
(Fig. 6B in (Giorgetti et al. 2014)). 

Once established, the X inactivation is stably trans¬ 
mitted through mitosis along the following development. 
Further (and later) features of the inactive X include 
hypermethylation of DNA, histone deacetylation, chro¬ 
matin condensation, i.e. the same general mechanisms 
that we have previously introduced. 


D. Non-coding RNA and microRNA 

The X inactivation is a clear example of the crucial 
and early role of RNA in epigenetic silencing. Together 
with histone and DNA modihcations, non coding RNAs 
have (more recently) emerged as one of the main epi¬ 
genetic mechanisms. Two main classes of epigenetically 
active non coding RNAs can be identified: small (<30 


nucleotides) and long (>200 nucleotides). Both classes 
play a role in heterochromatin formation, histone modi¬ 
fication, DNA methylation targeting, and gene silencing. 

Long RNAs can complex with chromatin-modifying 
proteins and recruit their catalytic activity to specihc 
sites in the genome, thereby modifying chromatin states 
and influencing gene expression. In the case of the X inac¬ 
tivation, the first described epigenetic mechanism involv¬ 
ing a long non coding RNA (IncRNA), it has been shown 
that Xist RNA directly recruits chromatin-modifying fac¬ 
tors as Polycomb repressive complex PRC2 that mediates 
histone H3 lysine 27 methylation, but the direct character 


of such interaction remains to be confirmed (Brockdorff 


2013), as well as the overall mechanism, including the 


interplay between the different RNA involved (Xist Tsix 
and others) (Pontier and Gribnau 2011). According to a 
proposed model (Lee 20121, long non coding RNAs may 
function by sequestering chromatin-modifying enzymes 
away from other interacting partners or by guiding chro¬ 
matin modifiers to the correct locations in the genome. 
In other cases, the long RNA seems to works by bind¬ 
ing and bringing together different types of proteins that 
can cooperate in establishing the repressive chromatin 
state (see (Marchese and Huarte, 2014) for a review on 
the interplay between IncRNAs and chromatin modifiers 
in epigenetics). Very recently, a single molecule, single 
cell RNA Fluorescence in situ hybridization (FISH) has 
allowed to quantify and categorize the subcellular local¬ 
ization patterns of a representative set of 61 IncRNAs in 
three different cell types, giving further quantitative in¬ 
sight in linking their putative role to their position in the 
cell (Cabili et al. 2015). LncRNAs exhibit a diversity 
of localization patterns, including a dispersed distribu¬ 
tion in the nucleus or in the cytoplasm, and condensed 
in nuclear foci, a conhguration which may be consistent 
with a role for these IncRNAs in chromatin regulation, as 
shown for XIST and other IncRNAs involved in imprint¬ 
ing. Most IncRNAs present however stronger nuclear 
localization than most messenger RNAs. Interestingly, 
IncRNAs do not persist at nuclear foci during mitosis, 
suggesting that retention at specific regulatory regions 
through mitosis is likely not a mechanism of mitotic in¬ 
heritance. 

A whole new realm of small non-coding RNAs was 
discovered in the late 90s. It includes two classes of 
small RNAs: micro RNAs (miRNAs) and small inter¬ 
fering RNAs (siRNAs), which perform many functions, 
and in particular are involved in the so called RNA inter¬ 
ference, a regulation pathway of gene expression at the 
transcriptional and post-transcriptional level. In other 
words, RNA interference may act either by inhibition of 
the target RNA transcription or by degradation of the 
transcript RNA. Piwi-interacting RNAs (piRNAs) rep¬ 
resent a third, large class of lightly longer RNAs, pa- 
lying a role in epigenetic and post-transcriptional gene 
silencing of retrotransposons and other genetic elements 
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in germ line cells through their association with piwi pro¬ 
teins. Again, we note here that unnecessary and poten¬ 
tially deleterious genes are often under epigenetic control 
(Cowley and Oakey 2010). 

At the post-translational level, the essential repression 
mechanism for both miRNAs and siRNAs is through 
pairing to a complementary sequence of the messenger 
RNA transcribed from a target gene, which results in 
the degradation of the RNA and thus the repression of 
the gene. To recognize its target messenger RNA, miR¬ 
NAs and siRNAs must be associated with a protein to 
form the RNA-induced silencing complex. 

Interestingly, heterochromatinization may also be ini¬ 
tiated by the RNA interference machiner^ p] that targets 
repetitive DNA sequences (Grewal and J., 2007). These 
are DNA sequences of up to several million base pairs 
and consist of a large number of repetitions of a much 
smaller sequence, and are found, in particular, at cen¬ 
tromeres, telomeres, and other regions that remain con¬ 
densed throughout the cell cycle, referred to as constitu¬ 
tive heterochromatin. At the same time, heterochromatin 
mediates the spreading of RNA interference machinery to 
surrounding sequences, hence the production of siRNAs, 
which in turn are essential for the stable maintenance of 
heterochromatic structures. 

A complex scenario thus emerges in which DNA se¬ 
quences, RNAs, epigenetic factors and chromatin remod¬ 
ellers plays together in the setting up and maintenance of 
different functional chromatin states. DNA-methylation 
and histone-modifications often act together to regulate 
miRNA expression, while, conversely, some miRNAs can 
regulate the expression of epigenetic machinery, with 


important dysregulation effects in cancer (Wang et al. 


2013). 


The models previously described for the spreading of 
epigenetic marks needs probably to include these ad¬ 
ditional features in order to reproduce the epigenetic 
mechanisms to a larger scale. A first step in this di¬ 
rection may be the study of the interplay between miR¬ 
NAs and epigenetic regulators, and the particular role 
of post-translational regulation, as discussed by Osella 
et al. (Osella et al. 2014). A typical basic regulatory 
network involving miRNAs and epigenetic regulators is 
the double negative feed-back loop, in which a miRNA 
represses an epigenetic regulator, which in turn represses 
the expression of the same miRNA. Starting from an ap¬ 
proach similar to what discussed in Sec. |IVj it is possible 
to describe the system. More precisely, the model vari¬ 
ables are the number of miRNAs, the number of mes- 
sanger RNAs that miRNAs repress, and the number of 


^ The RNA interference machinery includes different factors (as 
Dicer, Argonaute and RNA-dependent RNA polymerases), that 
produce the small RNAs or bind them to form functional com¬ 
plexes. 


proteins (epigenetic regulator) resulting from the messen¬ 
ger RNA translation (and repressing, in turn, the miR¬ 
NAs). The interesting input from the miRNA regulation 
step is its role in keeping fluctuations of gene expression 
under control, either by suppressing translation and by 
promoting RNA degradation. Both effects helps indeed 


in reducing the burstiness in protein production (Fried¬ 


man et al. , 2006 Osella et al. 2014). The resulting set of 


rate equations includes therefore, on one hand, the highly 
non-linear and bistable character of epigenetic regulation 
and, on the other, the stabilizing effect of miRNAs regu¬ 
lation, this leading to an increased stability of the system, 
and to an increased range of bistability of the switch. 

This result suggest possible reasons for the existence 
of regulatory pathways combining epigenetic regulators 
and miRNAs, although both experimental investigations 
and modeling of such complicated circuits are still at the 
embryonic stage. 


E. Supercoilingomics: supercoiling as a physical epigenetic 
mark, and its role in the initiation and maintenance of 
epigenetic marks 


DNA supercoiling was first properly described by 
Jerome Vinograd and colleagues (Vinograd et al. 1965) 
and it took just some more years to James Wang to dis¬ 
cover the first enzyme able to relax these topological con¬ 
straints in vivo (Wang 1971). DNA topological state is 
given by its linking number (Lk), defined as the num¬ 
ber of times that a strand of DNA winds in the right- 
handed direction around the helix axis when the axis is 
straight (or constrained to lie in a plane for a circular 
molecule). This integer is the sum of two geometrical pa¬ 
rameters: twist (or twisting number Tw, a measure of the 
helical winding of the DNA strands around each other, 
hence representing a “ID” deformation along the axis) 
and writhe (or writhing number Wr, a measure of the 3D 
coiling of the axis of the double helix). The partitioning 
between Tw and Wr for a given Lk is determined by 
the free energy of DNA (itself dependent on ionic condi¬ 
tions) and by DNA/protein interactions that locally im¬ 
pose some particular DNA torsion and/or writhe. While 
structural proteins can only alter the Tw/Wr ratio, en¬ 
zymes such as topoisomerases or gyrases can alter Lk by 
catalyzing the cleavage of one or both strands of DNA, 
followed by the passage of a segment of DNA through 


this break and the resealing of the DNA break (Wang 


2002 ). 


In most living organisms, DNA is negatively super- 
coiled, which prepares DNA for processes requiring sep¬ 
aration of the DNA strands, such as replication or tran¬ 
scription. In eukaryotes, this negative supercoiling is con¬ 
strained within the nucleosome, so that its removal will 
simultaneously favor the access and melting of previously 
occulted DNA, therefore facilitating transcription initia- 
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tion. The distribution of nucleosomes, and notably the 
NRL, appears then as an important feature to propagate 
through mitosis, partly for topological reasons. Regard¬ 
ing the elongation step, as DNA is screwing through the 
polymerase during transcription, the negative supercoil- 
ing induced in the back of the enzyme can propagate 
through the chromatin fiber and trigger local DNA alter¬ 
ations that have been proposed to serve as a regulatory 


signal for molecular partners 

(Belotserkovskii et al. 

2013 

Kouzine et al. 

2004 

2008 

jiu et al. 

2006 

1. Therefore 


supercoiling would act as a transient mechanotransducer 
as well as a physical epigenetic mark. Moreover, nucle- 
osome conformational changes might help to smoothen 
the elongation process by buffering some topological con¬ 
straint (Bancaud et al. 2007[ Recouvreux et al. 2011 


Vlijm et al.. 20151 and facilitate H2A/H2B dimer loss in 
front of the polymerase (Sheinin et al.\ 20131. It remains 


to be seen how much the structural differences provided 


by histone variants (Shaytan et al. 2015) would help to 


build “elongation friendly” regions that could be trans¬ 
mitted through cell division. 

Supercoiling of DNA has been recently proposed to be 
considered as a true physical epigenetic mark, entering 
the family of “omics” data one should consider to get 
a comprehensive genome-wide epigenetic landscape of a 


cell at a given state of its development (Lavelle 2014). In¬ 


deed, genome-wide maps of DNA supercoiling states have 


been generated 

Bermudez et al. 

20101 

loshi et al. |2010 

Kouzine et a71|2013a|b 

Naughton et al. 

2013 

Teves anc 


Henikoff 2014) which add to existing predicted maps of 


DNA melting (Liu et al. 2007) or G-quartet motifs (Du 


etal. .2009 Maizels and Gray 2013). The emerging pic¬ 


ture is that supercoiling is associated to the structuration 
of chromatin topological domains, which largely overlap 


with TADs (Naughton et al. 2013). 


DNA supercoiling is a physical epigenetic mark be¬ 
cause it may change the affinity of the underlying DNA 


sequence to specific transcription factors (Travers and 


Muskhelishvili 2007). Supercoiling may also silence a 


whole topological domain when recruiting TFs which in 
turn may recruit silencing enzymes, e.g. Suv39h which 


eventually deposits H3K9me3 epigenetic marks (Bulut- 
Karslioglu et al. 2012). Note that in this case, tran¬ 


scription factors are used to repressing instead of acti¬ 
vating gene expression. This mechanism relies on DNA 
allostery, i.e. the change of DNA affinity to some tran¬ 


scription factors that is induced by supercoiling (Lesne 

eTZlIM^. 


But how may supercoiling be initiated and herited? 
The distribution of topoisomerases and structural pro¬ 
teins such as condensins should help in transmitting some 
domain structuration and topological states through cell 


division (Aragon et al. 2013 Hirano 2014). 


The nucleosome repeat length (NRL, see Sec. 11.A) 
might also be a key control parameter. We first note 
that the twist rate of the DNA double helix in a given 


topological domain is a function of (i) remodeling activ¬ 


ity, notably through active nucleosome removal (Padin- 


hateeri and Marko, 20111, and (ii) topoisomerase activity. 


Importantly both these ATP-consuming mechanisms are 
under active control of the cell metabolism. Moreover 
both remodeling and topoisomerase activities regulate 
the value of the average NRL of a given topological do¬ 
main. Therefore the average NRL over some genomic do¬ 
main appears to be a physical epigenetic mark of this do¬ 
main. And the transmission of this average NRL through 
mitosis would transmit the twist rate of the domain. In¬ 
terestingly the recently observed spreading mode of hi¬ 


stone PTMs over transcription cycles (Terweij and van 


Leeuwen 20131 might explain the spreading and main¬ 


tenance of the NRL on epigenetic domains. In support 
of this hypothesis active remodeling processes achieved 
by ATP-consuming remodeling factors - and crucially 
through active nucleosome removal - have been shown 
to be essential for driving biologically relevant nucleo¬ 
some positioning (Padinhateeri and Marko 2011|| ), thus 
fixing the average NRL. Ghallenging genome-wide stud¬ 
ies are needed to further correlate supercoiling maps to 
cell differentiation states. 


VI. CONCLUSION AND PERSPECTIVES 

By putting the rich and diverse biological literature 
under the new light of a physical approach, the emerging 
picture is that a limited set of general physical rules play 
a key role in the epigenetic regulation of gene expres¬ 
sion. Processes at work are diffusion-limited and involve 
a small number of molecules, which precludes simple ap¬ 
proaches in terms of concentrations. Instead, multi-scale 
approaches articulating different models at different lev¬ 
els of organization are to be developed. 

Mainly, epigenetics display an intrication of physical 
mechanisms and specific biological entities, devised in the 
course of evolution to achieve an exquisitely coordinated 
and adaptable regulation of transcriptional activity. Our 
review demonstrates the need to take into account both 
aspects, within a dialogue between physics and biology, 
theory and living-cell experiments. Importantly, theoret¬ 
ical models are now amenable to experimental verifica¬ 
tion, thanks to the many technologies that are available 
to that end. It is worth noticing that the field of synthetic 


chromatin biology (Keung et al. 2015), which allows to 


modify and manipulate epigenetic states, is a promising 
new avenue for putting theories to the test. 

Significant and challenging issues remain: 

(i) coupling nuclear architecture and epigenetic mark¬ 
ing: understanding the interplay between 3D nucleus ar¬ 
chitecture and ID epigenetic marking (including barrier 
positions); 

(ii) physical signals that stimulate epigenetic marking: 
coupling nucleus mechanical deformations to epigenetic 
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marking. Cells dramatically change their shape and me¬ 
chanics during development by integrating physicochemi¬ 
cal signals from the local microenvironment (morphogens 
gradients, cell-cell contact, adhesion to extracellular ma¬ 


trix) to generate lineage-specific gene expression (Engler 


et al.\ 2006). Recent studies have begun to uncover 


the mechanisms by which these signals are integrated 
into the 3D spatiotemporal organization and epigenetic 


state of the nucleus and impact cell fate decision (Bellas 
and Chen] 2014[ [Ramdas and Shivashank^ |2015f Shiv 


ashankar 2011). Further understanding of these trans¬ 


duction mechanisms is a challenging perspective; 

(iii) equilibrium vs nonequilibrium physics: An im¬ 
plicit - yet overlooked - assumption of the models of bista¬ 
bility introduced by Sneppen and coworkers in the con¬ 
text of epigenetics (]Dodd et al. 2007 Micheelsen et al. 


2010) is that the system is open and far from equilib¬ 


rium. The nonequilibrium nature of their models lies 
in the asymmetry of the transitions: the recruitment of 
an unmarked nucleosome by a marked nucleosome does 
not affect the latter, in strong contrast with an equi¬ 
librium transition between two species. Indeed epige¬ 
netic marks undergo permanent recycling and biochem¬ 
ical transformations, so that epigenetic marks turn out 
to be steady states and not equilibrium states. There¬ 
fore, it is of primary importance to identify, characterize 
and model the various active physical mechanisms that 
are at work in the initiation and maintenance of epige¬ 
netic marks. In particular, it is crucial to evidence active 
(ATP-dependent) mechanisms that maintain epigenetic 
marks, for instance: metabolism, transcription, replica¬ 
tion, ionic pumps at the cell membrane. In our opinion 
this is a very challenging and timely topic for biology- 
oriented physicists. 
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